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PATENT 

Attorney Docket No.: 17682A-003630US 
Client Reference No.: 152C 

Modified Oligonucleotides for Mismatch Discrimination 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims the benefit of Provisional application Ser. No. 60/186,046, 
filed March 1, 2000; and is a continuation-in-part of 09/724,959, filed November 28, 2000; and is a 
continuation-in-part of U.S. Application No. 09/640,953, filed August 16, 2000; which is a 
continuation of U.S. Application Ser. No. 09/054,832, filed April 3, 1998; and is a continuation-in- 
part of U.S. Application Ser. No. 09/431,485, filed November 1, 1999; which is a continuation of U.S. 
Application Ser. No. 09/054,830, filed April 3, 1998, now U.S. Patent No. 6,127,121; the disclosures 
of each being incorporated herein by reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

Not applicable. 

BACKGROUND OF THE INVENTION 

This application is in the field of molecular biology relating to the use of 
oligonucleotides as probes and primers in liquid, solid and mixed phase assays. The 
application further relates to the use of modified nucleic acid bases and modified 
oligonucleotides to improve the hybridization properties and discriminatory abilities of 
oligonucleotides that are used in arrays and as probes and primers. 

Many techniques currently in use in molecular biology utilize oligonucleotides 
as probes and/or primers. It is often advantageous, in the practice of these techniques, to be 
able to distinguish between two or more sequences which are related but which differ by one 
or more nucleotides. For example, many mutations of clinical significance differ by only a 
single nucleotide from the wild-type sequence. Polymorphisms in mammalian genomes are 
also often characterized by sequence differences of one or a few nucleotides. The ability to 
make such a distinction is known as mismatch discrimination. In practical terms, mismatch 
discrimination describes the property by which a defined sequence oligonucleotide, at a given 
stringency, hybridizes strongly (one manifestation of which is that the hybrids have a high 
melting temperature) to a target sequence with which it is complementary along its entire 
length (a perfect hybrid or perfect match), but hybridizes detectably more weakly to a target 



sequence that is non-complementary to the sequence of the oligonucleotide at one or a few 
nucleotides (a mismatch). The differences in hybridization strength are such that a particular 
stringency can be selected at which a perfect match is detectable as a hybrid and a mismatch 
fails to form a hybrid. 

5 In a nucleic acid duplex, each base pair contributes to stability. Hence, the 

shorter the duplex, the greater the relative contribution of each individual base pair to the 
stability of the duplex. As a result, the difference in stability between a perfect match and a 
mismatch will be greater for shorter oligonucleotides. However, short oligonucleotides 
hybridize weakly, even to a perfectly complementary sequence, and thus must be hybridized 

10 under conditions of reduced stringency. Thus, the potential discriminatory power of short 
oligonucleotides cannot be easily realized except under conditions of low stringency. 

What is needed in the art are new methods for mismatch discrimination, 
particularly for single-nucleotide mismatches, under conditions of high stringency; for 
example, at the elevated temperatures characteristic of most nucleic acid amplification 

15 reactions. Surprisingly, the present invention provides such methods, along with new 
reagents and compositions which can be used in the methods. 

SUMMARY OF THE INVENTION 

20 The present invention provides a number of modified oligonucleotides found 

to have exceptional properties and usefulness in a variety of assays. Accordingly, the present 
invention also provides methods for using the modified oligonucleotides described herein. 

In one aspect, the present invention provides modified oligonucleotides having 
at least two bases selected from unsubstituted and 3-substituted pyrazolo[3,4-d]pyrimidine 

25 bases. In preferred embodiments, the oligonucleotides having modified bases will further 
comprise other moieties such as detectable labels, fluorescence and chemiluminescence 
quenchers and/or minor groove binders and/or other types of modified bases or base analogs. 

In another aspect, the present invention provides modified oligonucleotides 
having at least one 5 -substituted pyrimidine base and at least one 3-substituted pyrazolo[3,4- 

30 d]pyrimidine base. In preferred embodiments, these modified oligonucleotides will further 
comprise other moieties (as above) such as detectable labels, fluorescence and 
chemiluminescence quenchers and/or minor groove binders. 

In yet another aspect, the present invention provides methods for 
distinguishing polynucleotides having related sequences. 
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In still another aspect, the present invention provides methods for detecting the 
presence of a target sequence in a polynucleotide. 

In still other aspects, the present invention provides methods for primer 
extension, and methods for determining the nucleotide sequence of a polynucleotide. 
5 In related aspects, the present invention provides methods for examining gene 

expression in a cell, and methods for identifying a mutation or polymorphism in a target 
sequence of a gene of interest. 

In still another aspect, the present invention provides a number of modified 
bases that are useful in preparing modified oligonucleotides for the methods described herein 
10 and other conventional assays and methods. 

In yet another aspect, the present invention provides modified oligonucleotide 
arrays wherein the array members have T m s within about 1-2 °C and lengths within 1-2 bases 
from each other. Methods are also provided for determining sequences of the array members. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A and IB provide structures for several modified bases and their 
abbreviations. The wavy line is used to denote the position of an attached sugar moiety 
(unprotected, protected, activated and the like). 
20 Figure 2 is a graph illustrating the balancing of the T m of 8-mer GC-rich and 

AT-rich probes with different combinations of MGB, PPPA and PU. 

Figure 3 is a chart which illustrates one advantage achieved through the use of 
PPPA and PPG in MGB-modified oligonucleotide probes. As seen in the figure, the 
modified bases allow shortening of the probe that shows increased mismatch discrimination 
25 in real-time PCR. A is PPPA and G is PPG. 

Figure 4 illustrates an Invader™ assay in which the modified oligonucleotides 
of the invention can be used. 

Figure 5 illustrates a comparison of Invader™ probe performance with 
different numbers of PPGs. 
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DESCRIPTION OF THE INVENTION 



Abbreviations and Definitions 

5 Abbreviations for a number of the modified bases described herein are 

provided as follows (structures of these bases are shown in Figures 1 A and IB): 6-amino-3- 
prop-l-ynyl-5-hydropyrazolo[3,4-d]pyrimidine-4-one, PPPG; 6-amino-3-(3-hydroxyprop-l- 
yny)l-5-hydropyrazolo[3,4-d]pyrimidine-4-one, HOPPPG; 6-amino-3-(3-aminoprop-l-ynyl)- 
5-hydropyrazolo[3,4-d]pyrimidine-4-one, NH 2 PPPG; 4-amino-3-(prop-l-ynyl)pyrazolo[3 5 4- 

10 d]pyrimidine, PPPA; 4-amino-3-(3-hydroxyprop-l-ynyl)pyrazolo[3,4-d]pyrimidine, 

HOPPPA; 4-amino-3-(3-aminoprop--l-ynyl)pyrazolo[3,4-d]pyrimidine 5 NH 2 PPPA; 3-prop-l- 
ynylpyrazolo[3,4-d]pyrimidine-4,6-diamino, (NH 2 )2PPPA; 2-(4,6-diaminopyrazolo[3,4- 
d]pyrimidin-3-yl)ethyn-l-ol, (NH 2 ) 2 PPPAOH; 3-(2-aminoethynyl)pyrazolo[3,4- 
d]pyrimidine-4,6-diamine, (NH 2 ) 2 PPPANH 2 ; 5-prop-l-ynyl-l,3-dihydropyrimidine-2,4- . 

15 dione, PU; 5-(3-hydroxyprop-l-ynyl)-l,3-dihydropyrimidine-2,4-dione, HOPU; 6-amino-5- 
prop- 1 -ynyl-3-dihydropyrimidine-2-one, PC; 6-amino-5-(3 -hydroxyprop- 1 -yny)-l,3- 
dihydropyrimidine-2-one, HOPC; and 6-amino-5-(3-aminoprop-l-yny)-l,3- 
dihydropyrimidine-2-one, NH 2 PC; 5-[4-amino-3-(3-methoxyprop-l-ynyl)pyrazol[3,4- 
d]pyrimidinyl]-2-(hydroxymethyl)oxolan-3-ol, CH 3 OPPPA; 6-amino-l-[4-hydroxy-5- 

20 (hydroxymethyl)oxolan-2-yl]-3-(3-methoxyprop-l-ynyl)-5-hydropyrazolo[3,4 

one, CH3OPPPG; 5-(4-hydroxybuM-ynyl)-l,3-dihydropyrimidine-2,4-dione, HOBuU; 6- 
amino-5-(4-hydroxybut-l-ynyl)-3-hydropyrimidine-2-one, HOBuC; 4-(4-aminopyrazolo[3,4- 
d]pyrimidin-3-yl)but-3-yn-l-ol, HOBuPPA; 6-amino-3-(4-hydroxybut-l-ynyl)pyrazolo[3,4- 
d]pyrimidin-4-ol, HOBuPPG; 4-(4,6-diaminopyrazolo[3,4-d]pyrimidin-3-yl)but-3-yn-l-ol, 

25 (NH 2 ) 2 BuPPAOR 

Unless otherwise stated, the following terms used in the specification and 
claims have the meanings given below: 

The term "alky!" refers to a linear, branched, or cyclic saturated monovalent 
hydrocarbon radical or a combination of cyclic and linear or branched saturated monovalent 

30 hydrocarbon radicals having the number of carbon atoms indicated in the prefix. For 
example, (Ci-C 8 )alkyl is meant to include methyl, ethyl, ^-propyl, 2-propyl, f erf-butyl, 
pentyl, cyclopentyl, cyclopropylmethyl and the like. For each of the definitions herein (e.g., 
alkyl, alkenyl, alkoxy, aralkyloxy), when a prefix is not included to indicate the number of 
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main chain carbon atoms in an alkyl portion, the radical or portion thereof will have eight or 

fewer main chain carbon atoms. 

The term "alkylene" means a linear saturated divalent hydrocarbon radical or a 

branched saturated divalent hydrocarbon radical having the number of carbon atoms 
5 indicated in the prefix. For example, (Ci-C 6 )al'kylene is meant to include methylene, 

ethylene, propylene, 2-methylpropylene, pentylene, and the like. 

The term "alkenyl" refers to a linear monovalent hydrocarbon radical or a 

branched monovalent hydrocarbon radical having the number of carbon atoms indicated in 

the prefix and containing at least one double bond. For example, (C 2 -C 6 )alkenyl is meant to 
10 include, ethenyl, propenyl, and the like. 

The term "alkynyl" refers to a linear monovalent hydrocarbon radical or a 

branched monovalent hydrocarbon radical containing at least one triple bond and having the 

number of carbon atoms indicated in the prefix. For example, (C 2 -C 6 )alkynyl is meant to 

include ethynyl, propynyl, and the like. 
1 5 The terms "alkoxy," "alkylamino" and "alkylthio" (or thioalkoxy) are used in 

their conventional sense, and refer to those alkyl groups attached to the remainder of the 

molecule via an oxygen atom, an amino group, or a sulfur atom, respectively. Similarly, the 

term dialkylamino refers to an amino group having two attached alkyl groups that can be the 

same or different. 

20 The term "aryl" means a monovalent monocyclic or bicyclic aromatic 

hydrocarbon radical of 6 to 10 ring atoms which is unsubstituted or substituted independently 
with one to four substituents, preferably one, two, or three substituents selected from alkyl, 
cycloalkyl, cycloalkyl-alkyl, halo, nitro, cyano, hydroxy, alkoxy, amino, acylamino, mono- 
alkylamino, di-alkylamino, haloalkyl, haloalkoxy, heteroalkyl, COR (where R is hydrogen, 

25 alkyl, cycloalkyl, cycloalkyl-alkyl, phenyl or phenylalkyl), -(CR'R") n -COOR (where n is an 
integer from 0 to 5, R' and R" are independently hydrogen or alkyl, and R is hydrogen, alkyl, 
cycloalkyl, cycloalkylalkyl, phenyl or phenylalkyl) or -(CR'R") n -CONR a Rb (where n is an 
integer from 0 to 5, R' and R" are independently hydrogen or alkyl, and R a and R b are, 
independently of each other, hydrogen, alkyl, cycloalkyl, cycloalkylalkyl, phenyl or 

30 phenylalkyl). More specifically the term aryl includes, but is not limited to, phenyl, biphenyl, 
1-naphthyl, and 2-naphthyl, and the substituted forms thereof. Similarly, the term 
"heteroaryl" refers to those aryl groups wherein one or more heteroatoms or heteroatom 
functional groups have replaced a ring carbon, while retaining aromatic properties, e.g., 
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pyridyl, quinolinyl, quinazolinyl, thienyl, and the like. For brevity, the term aryl, when used 
in combination with other radicals (e.g., aryloxy, arylalkyl) is meant to include both aryl 
groups and heteroaryl groups as described above. 

The term "arylalkyl" refers to a radical -R a R b where R a is an alkylene group 

5 (having the indicated number of carbon atoms, or if unspecified having six or fewer main 
chain carbon atoms) and R° is an aryl group as defined herein. Examples of arylalkyl groups 
include benzyl, phenylethyl, 3-(3-chlorophenyl)-2-methylpentyl, and the like. 

Similarly the term "arylalkenyl" means a radical -R a R b where R a is an 
alkenylene group and R° is an aryl group as defined herein, e.g., 3-phenyl-2-propenyl, and 

10 the like. 

"Arylheteroalkyf" means a radical -R a R b where R a is an hetero alkylene group 
(having the indicated number of carbon atoms) and R b is an aryl group as defined herein, 
e.g., 2-hydroxy-2-phenyl-ethyl, 2-hydroxy-l-hydroxymethyl-2-phenyl-ethyl, and the like. 

The. term "aryloxy", refers to a radical -OR where R is an aryl group, e.g., 

15 phenoxy, naphthyloxy and the like. 

The prefix "halo" and the term "halogen" when used to describe a substituent, 

refer to -F, -CI, -Br and -I. 

The term "heteroalkyl" refers to an alkyl radical as defined herein with one, 
two or three substituents independently selected from cyano, -OR a , -NR b R c , and -S(0) n R d 
20 (where n is an integer from 0 to 2 ), with the understanding that the point of attachment of the 
heteroalkyl radical is through a carbon atom of the heteroalkyl radical. R a is hydrogen, alkyl, 
aryl, arylalkyl, alkoxycarbonyl, aryloxycarbonyl, carboxamido, or mono- or di- 
alkylcarbamoyl. R b is hydrogen, alkyl, aryl or arylalkyl. R c is hydrogen, alkyl, aryl, 
arylalkyl, alkoxycarbonyl, aryloxycarbonyl, carboxamido, mono- or di-alkylcarbamoyl or 

25 alkylsulfonyl. R d is hydrogen (provided that n is 0), alkyl, aryl, arylalkyl, amino, mono- 

alkylamino, di-alkylamino, or hydroxyalkyl. Representative examples include, for example, 
2-hydroxyethyl, 2,3-dihydroxypropyl, 2-methoxyethyl, benzyloxymethyl, 2-cyanoethyl, and 
2-methylsulfonyl-ethyl. For each of the above, R a , R b ,R C , and R d can be further substituted 
by NH 2 . fluorine, alkylamino. di-alkylamino, OH or alkoxy. Additionally, the prefix 

30 indicating the number of carbon atoms (e.g., C,-C, 0 ) refers to the total number of carbon 
atoms in the portion of the heteroalkyl group exclusive of the cyano, -OR a , -NR b R c , or 
-S(0) n R d portions. 
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The term "heterocyclyl" refers to a saturated or unsaturated non-aromatic 
cyclic radical of 3 to 8 ring atoms in which one or two ring atoms are heteroatoms selected 
from O, NR (where R is independently hydrogen or alkyl) or S(0) n (where n is an integer 
from 0 to 2), the remaining ring atoms being C, where one or two C atoms may optionally be 
5 replaced by a carbonyl group. The heterocyclyl ring may be optionally substituted 

independently with one, two, or three substituents selected from alkyl, halo, nitro, cyano, 
hydroxy, alkoxy, amino, mono-alkylamino, di-alkylamino, haloalkyl, haloalkoxy, -COR 
(where R is hydrogen, alkyl, cycloalkyl, cycloalkyl-alkyl, phenyl or phenylalkyl), -(CR'R") n - 
COOR (n is an integer from 0 to 5, R' and R" are independently hydrogen or alkyl, and R is 
10 hydrogen, alkyl, cycloalkyl, cycloalkyl-alkyl, phenyl or phenylalkyl), or -(CR'R") n - 

CONR a R b (where n is an integer from 0 to 5, R' and R" are independently hydrogen or alkyl, 
and R a and R b are, independently of each other, hydrogen, alkyl, phenyl or phenylalkyl). 
More specifically the term heterocyclyl includes, but is not limited to, tetrahydropyranyl, 
piperidino, N-methylpiperidin-3-yl, piperazino, N-methylpyrrolidin-3-yl, 3-pyrrolidino, 2- 
15 pyrrolidon-l-yl, morpholino, thiomorpholino, thiomorpholino-1 -oxide, thiomorpholino-1,1- 
dioxide, pyrrolidinyl, and the derivatives thereof The prefix indicating the number of carbon 
atoms (e.g., C3-C10) refers to the total number of carbon atoms in the portion of the 
heterocyclyl group exclusive of the number of heteroatoms. 

The terms ,, heterocyclylalkyl, ,, "heterocyclylalkenyl," "heterocyclylalkynyl" 

20 refer to radicals -R a R b where R a is an alkylene, alkenylene or alkynylene group, 

respectively, and R b is a heterocyclyl group as defined herein, e.g., tetrahydropyran-2- 
ylmethyl, 4-methylpiperazin-l-ylethyl, 3-piperidinylmethyl, and the like. 

The terms "heteroalkylene" means a linear saturated divalent hydrocarbon 
radical of one to six carbons or a branched saturated hydrocarbon radical of three to six 

25 carbon atoms with one, two or three substituents independently selected from -OR a , -NR b R c , 
and -S(0) n R d (where n is an integer from 0 to 2 ) where, R a , Rb, Rc, and R d are as defined 
herein for a heteroalkyl radical. Examples include, 2-hydroxyethan-l,2-diyl, 2- 
hydroxypropan-l,3-diyl and the like. 

Each of the above terms {e.g., "alkyl," "heteroalkyl," and "aryl") are meant to 

30 include both substituted and unsubstituted forms of the indicated radical. Preferred 
substituents for each type of radical are provided below. 
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Substituents for the alkyl and heteroalkyl radicals (including those groups 
often referred to as alkylene, alkenyl, heteroalkylene, heteroalkenyl, alkynyl, 
heterocycloalkyl, and heterocycloalkenyl) can be a variety of groups selected from: -OR', 
=0, =NR\ =N-OR\ -NR'R", -SR', -halogen, -SiR'R"R"\ -OC(0)R', -C(0)R', -C0 2 R\ - 
CONR'R", -OC(0)NR'R", -NR"C(0)R', -NR'-C(0)NR"R"\ -NR"C(0) 2 R', -NH- 
C(NH 2 )=NH, -NR'C(NH 2 )=NH, -NH-C(NH 2 )=NR\ -S(0)R\ -S(0) 2 R', -S(0) 2 NR'R", -CN 
and -N0 2 in a number ranging from zero to four, preferably, zero, one, two or three 
substituents. R', R" and R'" each independently refer to hydrogen, unsubstituted (C r 
C 8 )alkyl and heteroalkyl, unsubstituted aryl, aryl substituted with 1-3 halogens, unsubstituted 
alkyl, alkoxy or thioalkoxy groups, or aryl-(Ci-C 4 )alkyl groups. When R' and R" are 
attached to the same nitrogen atom, they can be combined with the nitrogen atom to form a 5- 
, 6-, or 7-membered ring. For example, -NR'R" is meant to include 1-pyrrolidinyl and 4- 
morpholinyl. From the above discussion of substituents, one of skill in the art will 
understand that the term "alkyl" in its broadest sense is meant to include groups such as 
haloalkyl (e.g., -CF 3 and -CH 2 CF 3 ) and acyl (e.g., -C(0)CH 3 , -C(0)CF 3 , -C(0)CH 2 OCH 3 , 
and the like). Preferably, the alkyl groups will have from 0-3 substituents, more preferably 0, 
1, or 2 substituents, unless otherwise specified. 

Similarly, substituents for the aryl groups are varied and are selected from: - 
halogen, -OR', -OC(0)R, -NR'R", -SR', -R', -CN, -N0 2 , -C0 2 R\ -CONR'R", -C(0)R', - 
OC(0)NR'R", -NR"C(0)R', -NR"C(0) 2 R', ,-NR'-C(0)NR"R", -NH-C(NH 2 )=NH, - 
NR'C(NH 2 )=NH, -NH-C(NH 2 )=NR', -S(0)R', -S(0) 2 R\ -S(0) 2 NR'R", -N 3 , -CH(Ph) 2 , 
perfluoro(C 1 -C 4 )alkoxy, and perfluoroCC-C^alkyl, in a number ranging from zero to the 
total number of open valences on the aromatic ring system; and where R', R" and R'" are 
independently selected from hydrogen, (C,-C 8 )alkyl and heteroalkyl, unsubstituted aryl and 
heteroaryl, (unsubstituted aryl)-(C,-C4)alkyl, and (unsubstituted aryl)oxy-(C,-C 4 )alkyl. 

Two of the substituents on adjacent atoms of the aryl or heteroaryl ring may 
optionally be replaced with a substituent of the formula -T-C(0)-(CH 2 ) q -U-, wherein T and U 
are independently -NH-, -0-, -CH 2 - or a single bond, and q is an integer of from 0 to 2. 
Alternatively, two of the substituents on adjacent atoms of the aryl or heteroaryl ring may 
optionally be replaced with a substituent of the formula -A-(CH 2 ) r -B-, wherein A and B are 
independently -CH 2 -, -0-, -NH-, -S-, -S(O)-, -S(0) 2 -, -S(0) 2 NR'- or a single bond, and r is an 
integer of from 1 to 3. One of the single bonds of the new ring so formed may optionally be 
replaced with a double bond. Alternatively, two of the substituents on adjacent atoms of the 
aryl or heteroaryl ring may optionally be replaced with a substituent of the formula -(CH 2 ) S - 
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X-(CH 2 )r, where s and t are independently integers of from 0 to 3, and X is -O-, -NR'-, -S-, - 
S(O)-, -S(0) 2 -, or -S(0) 2 NR'-. The substituent R' in -NR'- and -S(0) 2 NR'- is selected from 
hydrogen or unsubstituted (Ci-C6)alkyl. 

Certain compounds or oligonucleotides of the present invention may exist in a 

5 salt form. Such salts include base addition salts such as sodium, potassium, calcium, 

ammonium, organic amino, or magnesium salt, or a similar salt. When the compounds or 
modified oligonucleotides of the present invention contain relatively basic functionalities, 
acid addition salts can be obtained by contacting the neutral form of such compounds with a 
sufficient amount of the desired acid, either neat or in a suitable inert solvent. Examples of 

10 acceptable acid addition salts include those derived from inorganic acids like hydrochloric, 
hydrobromic, nitric, carbonic, monohydrogencarbonic, phosphoric, 
monohydrogenphosphoric, dihydrogenphosphoric, sulfuric, monohydrogensulfuric, 
hydriodic, or phosphorous acids and the like, as well as the salts derived from organic acids 
like acetic, propionic, isobutyric, maleic, malonic, lactic, benzoic, succinic, suberic, fumaric, 

1 5 ' mandelic, phthalic, benzenesulfonic, p-tolylsulfonic, citric, tartaric, methanesulfonic, and the 
like. Also included are salts of amino acids such as arginate and the like, and salts of organic 
acids like glucuronic or galactunoric acids and the like (see, for example, Berge, S.M., et al, 
"Pharmaceutical Salts", Journal of Pharmaceutical Science, 1977, 66, 1-19). Certain specific 
compounds of the present invention contain both basic and acidic functionalities that allow 

20 the compounds to be converted into either base or acid addition salts. 

The neutral forms of the compounds may be regenerated by contacting the salt 
with a base or acid and isolating the parent compound in the conventional manner. The 
parent form of the compound differs from the various salt forms in certain physical 
properties, such as solubility in polar solvents, but otherwise the salts are equivalent to the 

25 parent form of the compound for the purposes of the present invention. 

Certain compounds of the present invention can exist in unsolvated forms as 
well as solvated forms, including hydrated forms. In general, the solvated forms are 
equivalent to unsolvated forms and are intended to be encompassed within the scope of the 
present invention. Certain compounds of the present invention may exist in multiple 

30 crystalline or amorphous forms. In general, all physical forms are equivalent for the uses 

contemplated by the present invention and are intended to be within the scope of the present 
invention. 

Certain compounds of the present invention possess asymmetric carbon atoms 
(optical centers) or double bonds; the racemates, diastereomers, geometric isomers and 
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individual isomers are all intended to be encompassed within the scope of the present 
invention. The methods for the determination of stereochemistry and the separation of 
isomers are well-known in the art (see discussion in Chapter 4 of "Advanced Organic 
Chemistry", 4th edition J. March, John Wiley and Sons, New York, 1 992). 

The compounds of the present invention may also contain unnatural 
proportions of atomic isotopes at one or more of the atoms that constitute such compounds. 
For example, the compounds may be radiolabeled with radioactive isotopes, such as for 
example tritium ( 3 H), iodine-125 ( 125 I) or carbon-14 ( 14 C). All isotopic variations of the 
compounds of the present invention, whether radioactive or not (e.g, 2 H), are intended to be 
encompassed within the scope of the present invention. 

"Protecting group" or "protected form thereof refers to a grouping of atoms 
that when attached to a reactive group in a molecule masks, reduces or prevents that 
reactivity. Examples of protecting groups can be found in T.W. Greene and P.G. Futs, 
Protective Groups in Organic Chemistry, (Wiley, 2nd ed. 1991), Beaucage and Iyer, 
Tetrahedron 48:2223-23 1 1 (1992), and Harrison and Harrison et al, Compendium of 
Synthetic Organic Methods, Vols. 1-8 (John Wiley and Sons. 1971-1996). Representative 
amino protecting groups include formyl, acetyl, trifluoroacetyl, benzyl, benzyloxycarbonyl 
(CBZ), terr-butoxycarbonyl (Boc), trimethyl silyl (TMS), 2-trimethylsilyl-ethanesulfonyl 
(SES), trityl and substituted trityl groups, allyloxycarbonyl, 9-fluorenylmethyloxycarbonyl 
(FMOC), nitro-veratryloxycarbonyl (NVOC) and the like. Representative hydroxy protecting 
groups include those where the hydroxy group is either acylated or alkylated such as benzyl 
and trityl ethers as well as alkyl ethers, tetrahydropyranyl ethers, trialkylsilyl ethers and allyl 
ethers. 

"Optional" or "optionally" in the above definitions means that the 
subsequently described event or circumstance may but need not occur, and that the 
description includes instances where the event or circumstance occurs and instances in which 
it does not. For example, "heterocyclo group optionally mono- or di- substituted with an 
alkyl group" means that the alkyl may but need not be present, and the description includes 
situations where the heterocyclo group is mono- or disubstituted with an alkyl group and 
situations where the heterocyclo group is not substituted with the alkyl group. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques in organic chemistry, biochemistry, oligonucleotide synthesis and 
modification, bioconjugate chemistry, nucleic acid hybridization, molecular biology, 
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microbiology, genetics, recombinant DNA, and related fields as are within the skill of the art. 
These techniques are fully explained in the literature. See, for example, Maniatis, Fritsch & 
Sambrook, MOLECULAR CLONING: A LABORATORY MANUAL, Cold Spring Harbor 
Laboratory Press (1982); Sambrook, Fritsch & Maniatis, MOLECULAR CLONING: A 
LABORATORY MANUAL, Second Edition, Cold Spring Harbor Laboratory Press (1989); 
Ausubel, et al, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & 
Sons (1987, 1988, 1989, 1990, 1991, 1992, 1993, 1994, 1995, 1996); Gait (ed.), 
OLIGONUCLEOTIDE SYNTHESIS: A PRACTICAL APPROACH, IRL Press (1984); 
Eckstein (ed.), OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL 
APPROACH, IRL Press (1991). 

General 

Oligonucleotides are short polymers of nucleotides, generally less than 200 
nucleotides, preferably less than 150 nucleotides, more preferably less than 100 nucleotides, 
more preferably less than 50 nucleotides and most preferably less than 21 nucleotides in 
length. Polynucleotides are generally considered, in the art, to comprise longer polymers of 
nucleotides than do oligonucleotides, although there is an art- recognized overlap between the 
upper limit of oligonucleotide length and the lower limit of polynucleotide length. With 
respect to the present invention, "oligonucleotide" generally refers to a nucleic acid, usually 
comprising a detectable label, that is used as a probe or as a primer; while polynucleotide 
refers to a nucleic acid containing a target sequence. Consequently, for the purposes of the 
present invention, the terms "oligonucleotide" and "polynucleotide" shall not be considered 
limiting with respect to polymer length. 

The present invention provides modified oligonucleotides having new and 
surprising properties of superior mismatch discrimination, compared to unmodified 
oligonucleotides. Modified oligonucleotides of the invention are used as probes, wherein 
their hybridization to a target sequence is detected, or as primers, wherein their hybridization 
to a target sequence is followed by polynucleotide synthesis initiated from the 3' terminus of 
the modified oligonucleotide, and the synthesized product {i.e., the extension product) is 
detected. 

A target sequence refers to a nucleotide sequence which comprises a site of 
hybridization for a probe or a primer. Target sequences can be found in any nucleic acid 
including, but not limited to, genomic DNA, cDNA, RNA and any amplified product thereof, 
and can comprise a wild-type gene sequence, a mutant gene sequence, a non-coding 
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sequence, a regulatory sequence, etc. A target sequence will generally be less than 100 
nucleotides, preferably less than 50 nucleotides, and most preferably, less than 21 nucleotides 
in length. 

5 Description of the Embodiments 

The present invention provides a number of modified oligonucleotides which 

can generally be divided into three groups. 

The first group of modified oligonucleotides are those having at least two 

10 modified bases replacing the naturally-occurring bases. Here, the modified bases will be 

unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidines. In some embodiments, however, 
the modified bases will be selected such that at least one of the bases is a 5-substituted 
pyrimidine and at least one of the bases is an unsubstituted or a 3-substituted pyrazolo[3,4- 
d]pyrimidine. Preferably, this group of modified oligonucleotides will have additional 

15 attached groups (e.g., minor groove binders, reporter groups, quenchers, etc.) which assist 
during assays to detect target sequences. 

The second group of modified oligonucleotides are those having at least one 
modified base, but also having an attached minor groove binder, reporter group, quencher, or 
the like. 

20 The third group of modified oligonucleotides are those that comprise one or 

more of the novel modified bases described below. As in the first group, this group of 
modified oligonucleotides will preferably have attached groups selected from, for example, 
minor groove binders, reporter groups or quenchers. 

25 Modified Oligonucleotides 

In one aspect, the present invention provides modified oligonucleotides 
comprising at least two bases selected from the group consisting of unsubstituted and 3- 
substituted pyrazolo[3,4-d]pyrimidine bases. 
30 The pyrazolo[3,4-d]pyrimidine bases are those bases in which a pyrazole ring 

is fused to a pyrimidine ring in the following specific orientation: 
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X 2 




la 

in which the wavy line indicates the point of attachment between the base and either an 
attached oligomer-forming sugar or an amino acid involved in peptide nucleic acid formation. 
5 Additionally, the groups X 1 and X 2 are independently H, OH or NH 2 such that the 

pyrazolo[3,4-d]pyrimidine bases approximate the construction of the natural purine bases, 
guanosine, adenine and inosine, as well as related derivatives of those bases. In this group of 
embodiments, an "unsubstituted pyrazolo[3,4-d]pyrimidine base" refers to those bases of the 
general formula la, while the term "3 -substituted pyrazolo[3,4-d]pyrimidine base" refers to 
10 those bases having the formula lb, in which R 1 is defined as described below. 




Ib 

In formula Ib, the symbols X 1 and X 2 independently represent H, OH, NH 2 or 
a protected form thereof. The symbol R 1 represents a member selected from (Ci- 

15 Ci 2 )heteroalkyl, (C 2 -C, 2 )heteroalkenyl, (C 2 -C, 2 )heteroalkynyl, -0-(C r Ci 2 )alkyl, -O-(0 2 - 
C, 2 )alkenyl, -0-(C 2 -C 12 )alkynyl, -S-(C,-C 12 )alkyl, -S-(C 2 -C l2 )alkenyl, -S-(C 2 -C, 2 )alkynyl, 
heterocyclyl(C r Ci 2 )alkyl, heterocyclyl(C 2 -C 12 )alkenyl, heterocyclyl(C 2 -Ci 2 )alkynyl, aryl(Ci- 
C 12 )alkyl, aryl(C 2 -C, 2 )alkenyl, aryl(C 2 -Ci 2 )alkynyl, aryl, heterocyclyl, halogen, -CN, 
-CONH 2 and protected forms thereof. Preferred heteroalkyl, heteroalkenyl and heteroalkynyl 

20 groups are those that terminate (distal to the pyrazolo[3,4-d]pyrimidine ring system) in a 
heteroatom group (e.g., OH, NH 2 , SH and the like). For example, preferred heteroalkyl, 
heteroalkenyl and heteroalkynyl groups include 3-amino-l -propyl, 4-hydroxy-l -butyl, 3- 
amino- 1 -propyn- 1 -yl, 3-hydroxy- 1 -propyn- 1 -yl, 4-hydroxy-3-hydroxymethyl- 1 -butyn- 1 -yl, 
4-hydroxy-l -butyn-l-yl, and their higher homologs. Other preferred R 1 groups include those 

25 which terminate in an aryl or heterocyclic group (e.g., heterocyclyl(Ci-C, 2 )alkyl, 

heterocyclyl(C 2 -C (2 )alkenyl, heterocyclyl(C 2 -C, 2 )alkynyl, aryl(C r C, 2 )alkyl, aryl(C 2 - 
C t2 )alkenyl arid aryl(C 2 -C, 2 )alkynyl). Preferred aryl and heterocyclic groups are either 
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substituted or unsubstituted phenyl, thienyl, thiazolyl, imidazolyl, furanyl, oxazolyl, 
pyridinyl, pyrrolyl, indolyl, benzimidazolyl, benzthiazolyl and benzoxazolyl. 

In particularly preferred embodiments, the pyrazolo[3,4-d]pyrimidine base is 

selected from: 

O R i NH 2 R 1 ^ H 2 R 1 



.A 





HN T II N and 



TO 



H 2 N' "N' N ' H 2 N^N^N N If 



Ic Id Ie 

Still more preferably, the modified bases are selected from Ic, Id or Ie wherein 
each R 1 is a propynyl, hydroxypropynyl, aminopropynyl, aminobutynyl, hydroxybutynyl, or a 
substituted or unsubstituted phenyl, thienyl, thiazolyl, imidazolyl, furanyl, oxazolyl, 

10 pyridinyl, pyrrolyl, indolyl, benzimidazolyl, benzthiazolyl or benzoxazolyl group attached 
directly to the pyrazolo[3,4-d]pyrimidine ring system or attached to the ring system via a one 
to four carbon linking group which can be saturated (e.g., ethylene, propylene, butylene) or 
unsaturated (e.g., acetylenyl, propynylene, butynylene, propenylene, butenylene). 

The modified oligonucleotides of the present invention have a backbone such 

1 5 as, for example, that found with the naturally-occurring oligonucleotides or peptide-nucleic 
acids (e.g., heterocyclic bases attached to oligomer-forming sugars or peptide-nucleic acid- 
forming amino acids). Alternative oligomeric backbones are known to those of skill in the art 
(referred to below as "modified" backbones or comprising backbone modifications). For the 
purpose of this invention a modified oligonucleotide is any oligomeric polymer, including the 

20 natural phosphate backbone and non-natural backbones, that contains natural and/or modified 
bases and retains the ability to hybridize specifically to a complementary nucleic acid to form 
a stable duplex. 

In preferred embodiments, the modified oligonucleotides of the invention have 
a backbone of sugar or glycosidic moieties, preferably 2-deoxyribofuranosides wherein all 

25 intemucleotide linkages are the naturally occurring phosphodiester linkages. In alternative 
embodiments however, the 2-deoxy-P-D-ribofuranose groups are replaced with other sugars, 
for example, p-D-ribofuranose. In addition, p-D-ribofuranose may be present wherein the 2- 
OH of the ribose moiety is alkylated with a C,. 6 alkyl group (2-(0-C,-6 alkyl) ribose) or with 
a C 2 -6 alkenyl group (2-(0-C 2 . 6 alkenyl) ribose), or is replaced by a fluoro group (2- 

30 fluororibose). Related oligomer-forming sugars useful in the present invention are those that 
are "locked", i.e., contain a methylene bridge between C-4' and an oxygen atom at C-2\ 
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Other sugar moieties compatible with hybridization of the oligonucleotide can also be used, 
and are known to those of skill in the art, including, but not limited to, a-D- 
arabinofuranosides, a-2'-deoxyribofuranosides or 2',3'-dideoxy-3'-aminoribofuranosides. 
Oligonucleotides containing a-D-arabinofuranosides can be prepared as described in U.S. 
Patent No. 5,177,196. Oligonucleotides containing 2\3'-dideoxy-3 , -aminoribofuranosides are 
described in Chen et al. ((1995) Nucleic Acids Res. 23:2661-2668). Synthetic procedures for 
locked nucleic acids (Singh et al, Chem. Comm., 455-456 (1998); Wengel J., Acc. Chem. 
Res., 32:301-310 (1998)) and oligonucleotides containing 2'-halogen-2'- 
deoxyribofuranosides (Palissa et al., Z. Chem.. 27:216 (1987)) have been described.. The 
phosphate backbone of the modified oligonucleotides described herein can also be modified 
so that the oligonucleotides contain phosphorothioate linkages and/or methylphosphonates 
and/or phosphoroamidates (Chen et al., Nucl. Acids Res., 23:2662-2668 (1995)). 
Combinations of oligonucleotide linkages are also within the scope of the present invention. 
Still other backbone modifications are known to those of skill in the art. 

In another group of embodiments, the modified bases described herein are 
incorporated into PNA and DNA/PNA chimeras to balance T m s and provide modified 
oligonucleotides having improved mismatch discrimination. Various modified forms of 
DNA and DNA analogues have been used in attempts to overcome some of the disadvantages 
of the use of DNA molecules as probes and primers. Among these are peptide nucleic acids 
(PNAs, also known as polyamide nucleic acids). Nielsen et al. (1991) Science 254:1497- 
1500. PNAs contain heterocyclic base units, as found in DNA and RNA, that are linked by a 
polyamide backbone, instead of the sugar-phosphate backbone characteristic of DNA and 
RNA. PNAs are capable of hybridization to complementary DNA and RNA target sequences 
and, in fact, hybridize more strongly than a corresponding nucleic acid probe. The synthesis 
of PNA oligomers and reactive monomers used in the synthesis of PNA oligomers have been 
described in U.S. Patents No. 5,539,082; 5,714,331; 5,773,571; 5,736,336 and 5,766,855. 
Alternate approaches to PNA and DNA/PNA chimera synthesis and monomers for PNA 
synthesis have been summarized. Uhlmann et al. (1998) Angew. Chem. Int. Ed. 37:2796- 
2823. Accordingly, the use of any combination of normal bases, unsubstituted pyrazolo[3,4- 
d]pyrimidine bases (e.g., PPG and PPA), 3-substituted pyrazolo[3,4-d]pyrimidines, modified 
purine, modified pyrimidine, 5-substituted pyrimidines, universal bases or a minor groove 
binder to balance the T m of a PNA or DNA/PNA chimera is in the scope of this invention. A 
variety of universal bases are known in the art. Other universal bases have recently been 
described and are alos useful in the present invention (see, Seela, et al., XIV International 
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Round Table: Nucleosides, Nucleotides and Their Biological Applications, September 10-14, 
2000, San Francisco California, pp. 40). The synthetic methods necessary for the synthesis of 
modified base monomeric units required for PNA and PNA/DNA chimeras synthesis are 
available in the art, see methods in this application and Uhlmann et al. Angew. Chem. Int. Ed. 
5 37:2796-2823 (1998). 

Similarly, the present invention demonstrates a combination of normal bases, 
unsubstituted pyrazolo[3,4-d]pyrimidine bases (e.g., PPG and PPA), 3-substituted 
pyrazolo[3,4-d]pyrimidines, modified purine, modified pyrimidine, 5-substituted 
pyrimidines, universal bases or a minor groove binder can be used to balance the T m of any 

1 0 oligonucleotide polymer or oligopolymer/DNA chimera. 

For the uses described herein, the modified oligonucleotides will preferably 
have from 4 to 70 bases, more preferably from 4 to 25 bases. In some embodiments, the 
modified oligonucleotides will have 15 or fewer, or more preferably 10 or fewer bases. 
Additionally, the modified oligonucleotides will, in some embodiments, have 3, 4, 5, 6, 7, 8, 

15 or 9 or more modified bases (either unsubstituted or 3-substituted pyrazolo[3,4- 

d]pyrimidines). For each of the embodiments wherein multiple 3-substituted pyrazolo[3,4- 
d]pyrimidine bases are present, the modified bases can either be the same or different from 
each other. 

In addition to the modified base components, the modified oligonucleotides of 
20 the present invention will in some embodiments comprise additional pendant groups such as, 
for example, intercalators, lipophilic groups, minor groove binders, reporter groups, chelating 
agents, quenchers and cross-linking agents attached to one or more of the internally located 
nucleotide bases, to the 3', to the 5' end, to both ends, or can have such pendant groups 
attached both internally and at one or both ends. Methods suitable for attaching intercalator, 
25 lipophilic groups, minor grove binders, reporter groups, chelating agents, quenchers and 
cross-linking agents to oligonucleotides have been described in, for example, U.S. Patent 
Nos. 5,512,667 and 5,419,966, PCT publication WO 96/32496 and U.S. application Ser. No. 
09/457,616. The oligonucleotides of the invention can also have a relatively low molecular 
weight "tail moiety" attached either at the 3' or 5' end, or at both ends. By way of example a 
30 tail molecule can be a phosphate, a phosphate ester, an alkyl group, an aminoalkyl group, or a 
lipophilic group. The tail moiety can also link the intercalators, lipophilic groups, minor 
groove binders, reporter groups, chelating agents and cross-linking functionalities to the 
oligonucleotides of the invention. The nature of tail moieties and methods for obtaining 
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oligonucleotides with various tail moieties are also described in the above-referenced U.S. 
Patent Nos. 5,512,667 and 5,419,966. 

Minor Groove Binders 

In one group of embodiments, the modified oligonucleotide will also have a 
covalently attached minor groove binder (MGB). A variety of suitable minor groove binders 
have been described in the literature. See, for example, Kutyavin, et al. U.S. Patent No. 
5,801,155; Wemmer, D.E., and Dervan P.B., Current Opinon in Structural Biology, 7:355- 
361 (1997); Walker, W.L., Kopka, J.L. and Goodsell, D.S., Biopolymers, 44:323-334 (1997); 
Zimmer, C & Wahnert, U. Prog. Biophys. Molec. Bio. 47:31-1 12 (1986) and Reddy, B.S.P., 
Dondhi, S.M., and Lown, J. W., Pharmacol. Therap., 84:1-1 1 1 (1999). 

Suitable methods for attaching MGBs (as well as reporter groups such as 
fluorophores and quenchers described below) through linkers to oligonucleotides are 
described in, for example, U.S. Patent Nos. 5,512,677; 5,419,966; 5,696,251; 5,585,481; 

5,942,610 and 5,736,626. 

The MGB can be attached at either or both ends of the oligonucleotide. In 
addition or alternatively, one or more MGBs can be attached in the interior of the 
oligonucleotide, depending on the length of the oligonucleotide. In general, conjugation of a 
MGB to either end of an oligonucleotide would provide the greatest degree of hybrid 
stability, since melting of an oligonucleotide duplex begins at the termini. Nonetheless, if 
both ends of a duplex formed by an oligonucleotide are relatively stable, for example, due to 
a high G+C content, attachment of a MGB in the interior of an oligonucleotide (for instance, 
near an A+T-rich sequence) could also enhance stability. The intended use of the 
MGB-oligonucleotide conjugate may also place limitations on the location of the conjugated 
MGB. For example, if an oligonucleotide is designed to be used as a primer, the 3'-hydroxy 
group must be free and capable of being elongated by a polymerizing enzyme. Alternatively, 
an assay that requires an oligonucleotide possessing a labeled 5'-end would require internal 

or 3 '-end attachment of a MGB. 

The location of a MGB within a MGB-modified oligonucleotide conjugate can 
also affect the discriminatory properties of such a conjugate. An unpaired region within a 
duplex will result in changes in the shape of the minor groove in the vicinity of the mispaired 
base(s). Since MGBs fit best within the minor groove of a perfectly-matched DNA duplex, 
mismatches resulting in shape changes in the minor groove would reduce binding strength of 
a MGB to a region containing a mismatch. Hence, the ability of a MGB to stabilize such a 
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hybrid would be decreased, thereby increasing the ability of a MGB-oligonucleotide 
conjugate to discriminate a mismatch from a perfectly-matched duplex. On the other hand, if 
a mismatch lies outside of the region complementary to a MGB-oligonucleotide conjugate, 
discriminatory ability for unconjugated and MGB -conjugated oligonucleotides of equal 
length is expected to be approximately the same. Since the ability of an oligonucleotide 
probe to discriminate single base pair mismatches depends on its length, shorter 
oligonucleotides are more effective in discriminating mismatches. The primary advantage of 
the use of MGB-oligonucleotides conjugates in this context lies in the fact that much shorter 
oligonucleotides compared to those used in the pnor art {i.e., 20-mers or shorter), having 
greater discriminatory powers, can be used, due to the pronounced stabilizing effect of MGB 
conjugation. 

Preferred minor groove binders are those selected from the formulae: . 

/ ° 



CH 3 O CH 3 




R b - 



the subscript m is an integer of from 2 to 5; the subscript r is an integer of from 2 to 10; and 
1 5 each R a and R b is independently a linking group to the modified oligonucleotide, H, -OR c , 
-NR c R d , -COOR c or -CONR c R d , wherein each R c and R d is selected from H, (C,- 
C 12 )heteroalkyl, (C 2 -C 12 )heteroalkenyl, (C 2 -C 12 )heteroalkynyl, (C,-C l2 )alkyl, (C 2 - 
C 12 )alkenyl, (C 2 -C 12 )alkynyl, aryl(C,-C 12 )alkyl and aryl. 

Particularly preferred minor groove binders include the trimer of 3-carbamoyl- 
20 l,2-dihydro-(3^-pyrrolo[3,2-e]indole-7-carboxylate (CDPI 3 ), the pentamer of N- 

methylpyrrole-4-carbox-2-amide (MPC 5 ) and other minor groove binders that exhibit 
increased mismatch discrimination. Additional MGB moieties that will find use in the 
practice of the present invention are disclosed in co-owned U.S. Patent No. 5,801,155. In 
certain embodiments, the MGBs can have attached water solubility-enhancing groups (e.g., 
25 sugars or amino acids). 

Reporter Groups 

In another group of embodiments, the modified oligonucleotide will further 
compnse at least one cova.ently attached reporter group. The reporter groups can be attached 
30 using methods and linking groups described above for the MGBs. Suitable reporter groups 
for the present modified oligonucleotides include beads, nanoparticles (Taton, T.A. et al, 
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Science 289:1757-1760 (2000)), chemiluminescers, isotopes, enzymes and fluorophores. 
Preferably, the reporter group is a fluorophore (see, Haugland, R.P., HANDBOOK OF 
Fluorescent Probes and Research Chemicals, Sixth Edition, Molecular Probes, Eugene, 
OR, 1996). Suitable fluorophores include the resorufin dyes, coumarin dyes, rhodamine 
dyes, cyanine dyes, BODIPY dyes and pyrenes. 



Quenchers 



Recently developed detection methods employ the process of fluorescence 
resonance energy transfer (FRET) for the detection of probe hybridization rather than direct 
detection of fluorescence intensity. In this type of assay, FRET occurs between a donor 
fluorophore (reporter) and an acceptor molecule (quencher) when the absorption spectrum of 
the quencher molecule overlaps with the emission spectrum of the donor fluorophore and the 
two molecules are in close proximity. The excited-state energy of the donor fluorophore .. 
transferred to the neighboring acceptor by a resonance dipole-induced dxpole interaction, 
which results in quenching of the donor fluorescence. If the acceptor molecule is a 
fluorophore, its fluorescence may sometimes be increased. The efficiency of the energy 
transfer between the donor and acceptor molecules is highly dependent on distance between 
the molecules. Equafions descnbing this relationship are known. The Forster distance (R,) is 
described as the distance between the donor and acceptor molecules where the energy 
transfer is 50% efficient. Other mechanisms of fluorescence quenching are also known, such 
as collisions and charge transfer quenching. There is extensive guidance in the art for 
selecting quencher and fluor pairs and their attachment to oligonucleotides (Haugland, R.P., 
Handbook of Fluorescent Probes and Research Chemicals, Sixth Edition, Molecular 
Probes, Eugene, OR, 1996; U.S. Patent No, 3,996,345 and 4,351,760 and the like). 

Preferred quenchers are described in co-owned U.S. Ser. No. 09/457,616 (filed 

December 8, 1999). 

Fluorophores and Quenchers 

In certain embodiments of the present invention, oligonucleotides comprising 
fluorescent labels (fluorophores) and/or fluorescence quenching agents are used. In a 
preferred embodiment, an oligonucleotide contains both a fluorophore and a quenching agent. 
Fluorescent labels include, but are not limited to, fluoresceins, rhodamines, cyanmes, 
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phycoerythrins, and other fluorophores as described herein. Still other suitable fluorophores 
are known to those of skill in the art. As noted above, quenching agents or quenchers can 
absorb energy emitted by a fluorophore so as to reduce the amount of fluorescence emitted 
(i.e., quench the emission of the fluorescent label). Different fluorophores are quenched by 
different quenching agents. In general, the spectral properties of a particular 
fluorophore/quenching agent pair are such that one or more absorption wavelengths of the 
quencher overlaps one or more of the emission wavelengths of the fluorophore. A preferred 
fluorophore/quencher pair can be selected by one of skill in the art by comparison of 
emission and excitation wavelengths according to the properties set forth above. 
) For use in amplification assays conducted at elevated temperatures, such as a 

polymerase chain reaction, or other procedures utilizing thermostable enzymes, the label is 
preferably one that is stable at elevated temperatures. For assays involving polymerization, 
the label is one that does not interfere with the activity of the polymerizing enzyme. 
Additionally, the label will be present at the 5' and/or 3' end of the oligonucleotide, and/or 
15 can also be present internally in a non-interfering position. Accordingly, the label can be 
attached to any of the bases, sugars or phosphate moieties of the oligonucleotide, or to any 
linking group that is itself attached to one of these moieties. 

While the invention has been described in terms of the various bases and other 
20 optional components, the overall structure of the modified oligonucleotides can also be 
expressed in a formula to indicate a desired construction. 

Thus, in one group of embodiments, the modified oligonucleotides have the 

formula: 

B 

25 wherein R 2 and R 3 represent the termini of the modified oligonucleotide; the subscript n is an 
integer of from 4 to 70, more preferably 4 to 25 and still more preferably 4 to 10; each B is a 
member independently selected from adenine, thymine, cytosine, guanine, uracil, an 
unsubstituted P yrazolo[3,4-d] P yrimidine and a 3-substituted P yrazolo[3,4-d]pyrimidine; and 
each M is selected from an oligomer-forming sugar and a peptide-nucleic acid-forming ammo 

30 acid, with the proviso that at least two of the Bs are selected from unsubstituted pyrazolo[3,4- 
d]pyrimidines and a 3-substituted pyrazolo[3,4-d]pyrimidines. In certain specific 
embodiments, R 2 and R 3 represent the 5'- and 3'-ends of a modified oligonucleotides wherein 
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M is an oligomer-forming sugar (e.g., 2-deoxy-P-D-ribofuranose, p-D-ribofuranose, a-D- 
arabinofuranosides, a-Z-deoxyribofuranosides, 2' ) 3 , -dideoxy-3 , -aminoribofuranosides, and 
locked sugars). For those embodiments in which the modified oligonucleotides have attached 
minor groove binders, reporter groups, etc., each of the R 2 and R 3 groups are meant to include 
functional groups suitable for attachment of linking groups and the additional functional 
component (MGB, fluorophore, quencher, and the like). Such functional groups include, for 
example, hydroxy groups, amino groups, carboxylic acid or ester groups, phosphoric, 
phosphonic, or phosphinic acid or ester groups, sulfonic acid and ester groups, and the like. 
In other embodiments, MGBs, reporter groups and the like are attached to any of the interior 
base/backbone groups using conventional methodology. 

In a related aspect, the present invention provides modified oligonucleotides 
comprising at least one 5-substituted pyrimidine base and at least one unsubstituted or 3- 
substituted pyrazolo[3,4-d]pyrimidine base. The unsubstituted or 3-substituted pyrazolo[3,4- 
d]pyrimidine bases useful in this aspect of the invention are essentially those that have been 
described above (see formulae la and lb). A variety of 5-substituted pyrimidine bases are 
useful including those described in U.S. Patent Nos. 5,645,985 and 5,484,908. 

In preferred embodiments, the 5 -substituted pyrimidine base has the formula: 




Ila 

and the unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidine base has the formula: 



X 2 R 1 




lb 

wherein each of the X 1 , X 2 and X 3 groups is independently selected from H, OH, NH 2 and a 
protected amino group; and each of the R 1 and R 4 groups is independently selected from (C,- 
C 12 )heteroalkyl, (C 2 -C 12 )heteroalkenyl, (C 2 -C 12 )heteroalkynyl, -0-(C,-C, 2 )alkyl, -0-(C 2 - 
C 12 )alkenyl, -0-(C 2 -C 12 )alkynyl, -S-(C,-C 12 )alkyl, -S-(C 2 -C 12 )alkenyl, -S-(C 2 -C 12 )alkynyl, 
heterocyclyl(C,-C, 2 )alkyl, heterocyclyl(C 2 -C 12 )alkenyl, heterocyclyl(C 2 -C 12 )alkynyl, aryl(C,- 
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Ci 2 )alkyl, aryl(C 2 -Ci 2 )alkenyl, aryl(C 2 -Ci 2 )alkynyl, aryl, heterocyclyl, halogen, -CN, 
-CONH 2 and protected forms thereof, with the additional feature that R 1 can also be H. 

In particularly preferred embodiments, the 5-substituted pyrimidine base is 

selected from: 




lib He 
wherein R 4 is preferably selected from propynyl, hydroxypropynyl, aminopropynyl, 
hydroxybutynyl or a substituted or unsubstituted phenyl, thienyl, thiazolyl, imidazolyl, 
furanyl, oxazolyl, pyridinyl, pyrrolyl, indolyl, benzimidazolyl, benzthiazolyl or benzoxazolyl 
10 group attached directly to the pyrimidine ring or attached to the ring via a one to four carbon 
linking group which can be saturated (e.g., ethylene, propylene, butylene) or unsaturated 
(e.g., acetylenyl, propynylene, butynylene, propenylene, butenylene). 

In other preferred embodiments, the pyrazolo[3,4-d]pyrimidine base is 
selected from the preferred bases provided above (Ic, Id and Ie). Additionally, suitable bases 
15 are also depicted in Figure 1. 

As with the earlier aspect of the invention wherein the modified 
oligonucleotides comprise at least two unsubstituted or 3-substituted pyrazolo[3,4- 
d]pyrimidines, this aspect of the invention can similarly comprise additional groups such as 
MGBs and reporter groups (e.g., fluorophores, quenchers and the like) as well as linking 
20 groups suitable for the attachment of these additional components. 

Also preferred are those embodiments in which the modified oligonucleotide 
is represented by the formula: 



B 




wherein R 2 represents a first end of the modified oligonucleotide; R 3 represents a second end 
25 of the modified oligonucleotide; the subscript n is an integer of from 4 to 70; each B is 
independently selected from adenine, thymine, cytosine, guanine, uracil, a 5-substituted 
pyrimidine, an unsubstituted pyrazolo[3,4-d]pyrimidine and a 3-substituted pyrazolo[3,4- 
djpyrimidine (with the proviso that at least one base is a 5-substituted pyrimidine and at least 
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one base is a pyrazolo[3,4-d]pyrimidine); and each M is an oligomer-forming sugar or a 
peptide-nucleic acid-forming amino acid. 



In yet another aspect of the invention, modified oligonucleotides are provided 
which comprise an attached minor groove binder and in which as few as one of the bases is 
replaced with a modified base. Surprisingly, the combination of MGBs and a single modified 
base (or optionally multiple modified bases) leads to modified oligonucleotides having 
particularly useful properties for mismatch discrimination as well as primer extesions and 
other utilities described below. This group of modified oligonucleotides preferably comprise 
from 4 to 70 bases, and an attached minor groove binder, wherein at least one of the bases is 
replaced by a modified base selected from the group consisting of 5-substituted pyrimidines 
and unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidines. 

In this aspect of the invention, the minor groove binders, 5-substituted 
pyrimidines and unsubstituted or 3-substituted pyrazolo[3,4-d]pyrimidines can be essentially 
any of those components described above. 

In one group of embodiments, the modified oligonucleotide has at least one 5- 
substituted pyrimidine, preferably having the formula: 



X 3 



XJ 



R 4 



O N 
I 



Ila 

wherein X 3 is selected from H, NH 2 , OH and SH; and R 4 is selected from (Q- 
C (2 )heteroalkyl, (C 2 -Ci 2 )heteroalkenyl, (C 2 -Ci 2 )heteroalkynyl, -0-(Ci-Ci 2 )alkyl, -0-(C 2 - 
C 12 )alkenyl, -0-(C 2 -C 12 )alkynyl, -S-(C,-C 12 )alkyl, -S-(C 2 -C 12 )alkenyl, -S-(C 2 -C I2 )alkynyl, 
heterocyclyl(C,-C 12 )alkyl, heterocyclyl(C 2 -C, 2 )alkenyl, heterocyclyl(C 2 -C, 2 )alkynyl, aryl(C- 
C, 2 )alkyl, aryl(C 2 -C l2 )alkenyl, aryl(C 2 -C, 2 )alkynyl, aryl, heterocyclyl, halogen, -CN, 
-CONH 2 and protected forms thereof. In this group of embodiments, preferred aryl and 
heterocyclyl group (including those components of other groups, e.g., arylalkyl) are selected 
from substituted and unsubstituted versions of phenyl, tolyl, pyridyl, thiazolyl, imidazolyl, 
furanyl, oxazolyl, thienyl, pyrrolyl, benzimidazolyl, benzoxazolyl, benzthiazolyl, indolyl, 
triazinyl, pyrimidinyl and naphthyl. Suitable substituents for these aryl and heterocyclyl 
groups are those provided in the general definitions above. 



More preferably, the modified oligonucleotides comprise at least one 5- 
substituted pyrimidine based group of Formula lid and He, where the R 4 substituent group is 
selected from a substituted or unsubstituted alkyl group or alkynyl group. Preferably, R 4 is a 
hydroxypropynyl, hydroxybutynyl, aminopropynyl, aminobutynyl or propynyl group, as well 
as the higher homologs (e.g., C 5 -C 8 ) of these alkynyl groups. Alternatively, the R 4 
substituent can be a 3-(hydroxymethyl)-4-hydroxy-l-butynyl group (described in more detail 

below). 



NH 2 

R 4 -A .R" 

HN 





N |i •y 

O O^hT O O^N 

o=p-\ o=p-| 
o- o- 

Ild He 




In another group of preferred embodiments, the modified oligonucleotide has 
at least one modified base having the formula: 



1 R1 




N 



lb 



wherein each of the X 1 and X 2 groups is independently selected from H, OH, NH 2 and a 
protected amino group; and R 1 is selected from H, (C-C.^heteroalkyl, (C 2 - 
C 12 )heteroalkenyl, (C 2 -C 12 )heteroalkynyl, -0-(C,-C 12 )alk y l, -0-(C 2 -C 12 )alkenyl, -0-(C 2 - 
C I2 )alkynyl, -S-(C,-C 12 )alkyl, -S-(C 2 -C 12 )alkenyl, -S-(C 2 -C 12 )alkynyl, heterocyclyl(C,- 
C 12 )alkyl, heterocyclyl(C 2 -C 12 )alkenyl, heterocyclyl(C 2 -C 12 )alkynyl, aryKC-C^alkyl, 
aryl(C 2 -C l2 )alkenyl, aryl(C 2 -C 12 )alkynyl, aryl, heterocyclyl, halogen, -CN, -CONH 2 and 
protected forms thereof. In this group of embodiments, preferred aryl and heterocyclyl group 
(including those components of other groups, e.g., arylalkyl, heterocyclylalkyl) are selected 
from substituted and unsubstituted versus of phenyl, tolyl, pyridyl, thiazolyl, imidazolyl, 
furanyl, oxazolyl, thienyl, pyrrolyl, benzimidazolyl, benzoxazolyl, benzthiazolyl, indolyl, 
triazinyl, pyrimidinyl and naphthyl. Suitable substituents for these aryl and heterocyclyl 
groups are those provided in the general definitions above. 
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In particularly preferred embodiments, the modified oligonucleotides comprise 
at least one monomer of formula If, 




5 ^ 

wherein X 1 and X 2 are independently H, OH or NH 2 ; and R 1 is most preferably 3- 
hydroxypropyn-l-yl, propynyl, 3-aminopropyn-l-yl, 4-hydroxy-l-butynyl, 3- 
(hydroxymethyl)-4-hydroxy-l-butynyl, halogen or 3,3,3-trifluoropropyn-l-yl. Syntheses of 
some of these monomers have been reported (Balow et al., Nuc. Acid Res., 26:3350-3357 

10 (1998); Seela et al., J. Chem. Soc. Perkin Trans., I, 479-488 (1999); Ramzaeva et al., Helv. 
Chim. Acta, 80:1809-1822 (1997)). 

Preparation of Modified Bases and Oligonucleotides 

x 5 Reaction Schemes 1-10 provide illustrative methods for preparing a number of 

modified bases (unsubstituted and 3-substituted pyrazolop^-dlpyrimidines and 5-substituted 
pyrimidines) that are useful in the present invention. The schemes illustrate the preparation 
of phosphoramidite derivatives of the modified bases which can be used in, for example, 
automatic synthesizers for preparing the modified oligonucleotides of the invention. 

20 Reaction Scheme 1 illustrates the preparation of 5-(prop-2-ynyl-4- 

methylbenzoate)-5 , -0-(4,4'-dimethoxytriphenylmethyl)-2'-deoxyuridine3'-[(2-cyanoethyl) 

N,N-diisopropylphosphoramidite] (3) starting from 5-iodo-2'-deoxyuridine. The starting 

materials was treated with prop-2-ynyl 4-methylbenzoate in the presence of Pd(PPh 3 ) 4 -CuI to 

yield the methylbenzoate derivative (1) which was then converted to the 5'-blocked DMTr 

25 derivative (2) and then to the phosphoramidite (3). 
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Reaction Scheme 1 





NC(CH 2 hOP(Cl)N(iPr) 2 



DmTrCI/Pyridine 



DMTrO' 



P-N 




HO 



Reaction Schemes 2 and 3 provide one method for the preparation of 
pyrazolo[3,4-d]pyrimidine phosphoramidites. More particularly, these schemes illustrate the 
preparation of 3-[-4((lE)-l-aza-2-methylprop-l-enyl)-l-((2R,5R)-4- 
{[bis(methylethyl)amino](2-cyanoethoxy)phosphinooxy}-5-{[bis(4- 

m ethoxyphenyl)phen y lmethoxy]methyl}oxolan-2-yl)pyrazolo[3,4-d]pyrimidin-3-yl]prop-2- 
ylnyl 4-methylbenzoate (13; R,=-OCOPhCH 3 ) is synthesized in two portions. 
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Reaction Scheme 2 
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In the first portion (Reaction Scheme 2), (2R,5R)-5-(4-amino-3- 
iodopyrazolo[3,4-d]pyrimidinyl)-2-(hydroxymethyl)oxolan-3-ol (9) was synthesized starting 

5 from l,5-hydropyrazolo[3,4-d]pyrimidin-4-one. The starting material was treated with iodine 
monochloride and P 2 S 5 successively to yield the iodo (4) and thione (5) derivatives 
respectively. The thione (5) was converted to the ethylthio-derivative (6), which was 
combined with a l-chloro-l,2-dideoxy-3,5-di-C>-toluoylribofuranose derivative to yield the 
blocked nucleoside (7). Reaction of compound (7) with sodium methoxide followed by 

1 0 ammonium hydroxide gave hydroxymethyl derivative (8) and compound (9) respectively. 

In the second portion (Reaction Scheme 3), 2, 3-[-4((lE)-l-aza-2-methylprop- 
l-enyl)-l-((2R,5R)-4-{[bis(methylethyl)amino](2-cyanoethoxy)phosphinooxy}-5-{[bis(4- 
methoxyphenyl)phenylmethoxy]methyl}oxolan-2-yl)pyrazolo[3 ) 4-d]pyrimidin-3-yl]prop-2- 

ylnyl 4-methylbenzoate (13 R,=-OCOPhCH 3 ) was prepared starting from compound (9). 
1 5 Compound (9) was reacted with prop-2-ynyl 4-methylbenzoate in the presence of Pd(PPh 3 ) 4 - 
Cul to yield the prop-2-ynyl derivative (10; R,=-OCOPhCH 3 ). The amino group in this 
compound was protected by reaction with N,N dimethylacetamide dimethyl acetal to give 
(ll;R,=-OCOPhCH 3 ). Compound (11; R,=-OCOPhCH 3 ) was converted to the DMTr 
derivative (12; R,=-OCOPhCH 3 ) and then to the phosphoramidite (13; R^-OCOPhCH^. 
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Reaction Scheme 3 




R,=H or-OCOPhCHj 

Reaction Scheme 4 illustrates the preparation of N-{3-[l-((2R,5R)-5-{bis(4- 
methoxyphenyl)phenylmethoxy]methyl} -4- { [bis(methylethyl)amino](2- 
cyanoethoxy)phosphinooxy}oxolan-2-yl)-6-amino-4-oxo(5-hyo^opyrazolo[3,4-d]pyrimidin- 

3-yl)]propyl}-2,2,2-trifluoroacetamide (22). 
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Reaction Scheme 4 
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DMTrCI/Pyridine 
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In Reaction Scheme 4, 6-amino-4-methoxypyrazolo[3,4-d]pyrimidine was 
converted to the iodo derivative (14) which was treated with l-chloro-l,2-dideoxy-3,5-di-0- 
toluoylribofuranose to yield the nucleoside (15). Compound (15) was treated with 
NaOMe/MeOH to hydrolyze the toluoyl groups and yield the unblocked nucleoside (16). 
Treatment of (16) with aqueous sodium hydroxide gave (17) which could be converted to the 
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15 



20 



trifluoro-N-prop-2-ynyl acetamide derivative (18). Compound (18) was reduced with 
hydrogen and a Pd catalyst to yield the trifluoro-N-propylacetamide derivative (19). The 4- 
amino group of compound (19) was protected by treatment with N,N-dimethylformamide 
dimethylacetal to give compound (20) which was converted to DMTr derivative (21) and 
then to the phosphoramidite derivative (22). 

Compounds of Formula 4 (3-[((2R,5R)-5-(6-amino-4-oxo-3-prop-l-ynyl(5- 
hydropyrazolo[3,4-d]pyrimidinyl))-2-{[bis(4-methoxyphenyl)phenylmethoxy]- 
methyl}oxolan-3-yloxy)[bis(methylethyl)amino]phosphinooxy]propanenitrile) (PPPG 
phosphoramidite) can be synthesized using some of the reactions in Reaction Scheme 4. 

_R 6 



N N N 

\ I i 




6- P - N(iPr)2 

} ^(CHsfeCN 




,N(iPr) 2 
v O(CH 2 )2CN 



Formula 4 



Formula 5 



In the conversion of (17) to (18), prop-l-yne is used in place of 2,2,2-trifluoro- 
N-prop-2-ynylacetamide. The 6-amino group can be protected as described for compound 

(20) and the 5'-hydroxyl can be protected with a DMTr group as described for compound 

(21) . Finally the phosphoramidite can be prepared as described for compound (22) to yield 
the compound of Formula 4. In a similar way compounds of Formula 5 where R 6 is 
-OCOPhCH 3 or -NHCOCF 3 . can be prepared using related reactions known in the art. 

Reaction Scheme 5 illustrates the preparation of 3-{[5-(4,6-bis{(lE)-l-aza-2- 
[bis(2-methylpropyl)amino]vinyl}-3-prop-l-ynylpyrazolo[3,4-d]pyrimidinyl)-2-{[bis(4- 
methoxyphenyl)phenylmethoxy]-methyl}oxolan-3-yloxy][ethyl(methylethyl)amino}- 

phosphino}propanenitrile (26). 
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Reaction Scheme 5 




Compound (26) can be synthesized starting from (NH 2 ) 2 PPPA (23) (see, Seela 
& Driller, Helv. Chim. Acta 71 =757-761(1988)). Compound (23) can be converted to the 
bis(methylethyl)amino derivative (24) (Vincent et al, J. Org. Chem., 64:991-997 (1999)), 
followed by reaction first with DMTrCl to yield (25) that can be converted to the 

phosphoramidite (26). 

Reaction Scheme 6 provides the synthesis of protected 3-substituted 

pyrazolo[3,4-d]pyrimidines in which the substituent is a heteroalkyl group. 



T1 



Reaction Scheme 6 




Reaction Scheme 7 provides the synthesis of protected 3-substituted 
pyrazolo[3,4-d]pyrimidine in which the substituent is a methoxypropynyl group. The 3-iodo 

5 diamino derivative (23) was reacted with Pd(PPh 3 ) 4 -CuI, triethylamine in anhydrous DMF 
followed by methyl propargyl ether to yield the 3-methoxypropynyl-2,4- 
diaminopyrazolo[3,4-d]pyrimidine derivative (27). The amino groups in this compound were 
blocked by reaction with N.N-dimethylformamide dimethylacetal to yield (28). The blocked 
nucleoside was first reacted with dimethoxytrityl chloride and then with 2-cyanoethyl 

1 0 diisopropylchlorophosphoramidite to yield the desired blocked phosphoramidite (29). 
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Reaction Scheme 7 




29 

The following reaction schemes provide methods for the preparation of 3- 
heterocycle substituted pyrazolo[3,4-d]pyrimidines. The general methods provided herei 
can be adapted for the preparation of other heterocyclic substituents. 
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Reaction Scheme 8 




1) NaOCH 3 /MeOH 

2) (CH 3 0) 2 CHN(CH 3 ) 2 




1) DMTrO/Pyr 

2) ClpP-^CN 




DMTrO' 



Reaction Scheme 8 provides the synthesis of protected 3-substituted 
pyra2olo[3,4-d]pyrimidine in which the substituent is a furanyl group. Malonitrile was 
reacted with 2-furfuryl chloride in the presence of a base, followed by treatment with 
dimethylsulfide to g.ve the methoxy dinitrile derivative (30). Reaction of (30) with hydrazine 
yielded the substituted pyrazole (31) which was reacted with formamide to give 3-(2- 
furyl)pyrazolo[3 J 4-d] P yrimidine-4-ylamine (32). Base (32) was treated with l-chloro-1,2- 
dideoxy-S.S-di-O-toluoylribofuranose to yield the blocked nucleoside (33). Compound (33) 
was first treated with NaOMe/MeOH to hydrolyze the toluoyl groups to yield the unblocked 
nucleoside which was reacted with N.N-d.methylformarmde d.methylacetal to yield the 
protected nucleoside denvative (34). This denvative was first reacted with dimethoxytntyl 
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chloride and then with 2-cyanoethyl diisopropylchlorophosphoramidite to yield the desired 
blocked phosphoramidite (35). 



Guanidinium 
carbonate 
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2) NaOCH 3 /CH 3 OH 
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3 

Reaction Scheme 9 provides the synthesis of protected 3-substituted 2,4- 
diamino pyrazolo[3,4-d]pyrimidine in which the substituent is a furanyl group. The 
carbonitrile (31) was reacted with guanidinium carbonate to yield 3-(2-furyl)pyrazolo[3,4- 
d]pyrimidine-4,6-diamine (36). This base was treated with l-chloro-l,2-dideoxy-3,5-di-0- 

1 0 toluoylribofuranose to yield the blocked nucleoside which was treated with NaOMe/MeOH 
to hydrolyze the toluoyl groups and yield the unblocked nucleoside derivative (37). The 
latter compound was reacted with N,N-dimethylformamide dimethylacetal to yield the 
protected nucleoside derivative (38). This derivative was first reacted with dimethoxytrityl 
chloride and then with 2-cyanoethyl diisopropylchlorophosphoramidite to yield the desired 

15 blocked phosphoramidite (39). 
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Reaction Scheme 10 



DMTrO 




Reaction Scheme 10 provides one synthesis route to a hydroxy-protected 
HOBU-phosphoramidite. In this scheme, 3-butyn-l-ol is converted to its p-toluoyl ester (40) 
withp-toluoyl chloride in the presence of pyridine. The protected butynol is combined with 
5-iodo-2'-deoxyuridine in the presence of tetrakis(tri P henylphosphine)palladium(0) and 
copper iodide, then protected (as a DMT ether) to form 41, which can then be converted to its 
corresponding phosphoramidite derivative 42. Reagent 42 can be used directly in the 
preparation of modified oligonucleotides. Removal of thep-toluoyl protected group can be 
accomplished according to standard methods. 

In each of the above schemes, one of skill in the art will understand that other 
protecting groups and/or activating groups can be used. Additionally, different non-natural 
base-containing monomers, with different functional groups, can require different blocking 
groups for successful synthesis of the modified, oligonucleotides. A variety of useful 
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protection groups, their synthesis and de-protection methods are described in, for example, 
Beaucage & Iyer, Tetrahedron 48:2223-231 1 (1992). 

The synthesis of oligonucleotides, and modified oligonucleotides can be 
initiated from a solid support containing a cleavable linker to which is attached the first base. 
The oligonucleotides of the invention can be synthesized containing a substituted 
pyrazolo[3,4-d]pyrimidine as the first nucleotide at the 3'-end, using an appropriate 3- 
substituted pyrazolo[3,4-d]pyrimidine. Alternatively, a 5-substituted pyrimidine can be 
attached initially to a solid support. Reaction Scheme 1 1 illustrates a general approach to the 
synthesis of a protected 3-substituted pyrazolo[3,4-d]pyrimidine dA CPG derivative (C). 
Intermediate (A) can be prepared using procedures described in Reaction Scheme 3, while 
(B) and (C) can be synthesized as described in co-pending U.S. application Ser. No. 
09/457,616. 
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In this aspect of the invention, a vanety of solid supports are useful, provided 
the support is compatible with automated oligonucleotide techniques and include, glass, 
polystyrene, nylon, plastic and the like. Additonally, the present invention provides in a 

5 related aspect, a solid support (e.g., controlled pore glass (CPG)) having an attached modified 
base preferably those of formulae Ic. Id, le, If, lib, He, lid, He. More preferably, the attached 
m odified base is a 3-subst.tuted pyrazolo[3,4-d]pynmidine or a 5-substituted pynmidine 
mo ,ety wherein the substituents are selected from the preferred groups desenbed above, and 
further including the novel bases described in more detail below. 

10 The present invention further includes those compositions and methods 

wherein any of the unsubstituted and/or 3-substituted pyrazolo[3,4-d]pyrimidines and/or 5- 
substituted pyrimidines are combined or used in combination with other modified bases 
known in the art. Other modified modified monomenc units have been disclosed Scheit, 
NUCLEOTIDE ANALOGS, John Wiley, New York, 1980; Uhlman and Meyman, Chemical 

15 Review 90:543-584 (1990), Seela and Debelak, Nucl. Acids Res., 28:3224-3232 (2000); 
Balow et al, Nucl. Acids Re,, 26:3350-3357 (1998); Belli et al Nucl. Acids Re,, 24:4660- 
4667 (1996). 

Modified Oligonucleotide Arrays 

20 

In another embodiment of the present invention, modified oligonucleotides are 
used in procedures which utilize arrays of oligonucleotides, such as sequencing by 
hybridization and array-based analysis of gene expression. A variety of arrays are 
contemplated by the present invention including, for example, chip or platform arrays, bead 
25 arrays, liquid phase arrays, ^ip-code' arrays and the like. In sequencing by hybridization, an 
ordered array of oligonucleotides of different known sequences is used as a platform for 
hybridization to one or more test polynucleotides, nucleic acids or nucleic acid populations. 
Determination of the oligonucleotide which are hybridized and alignment of their known 
sequences allows reconstruction of the sequence of the test polynucleotide. Alternatively, 
30 oligonucleotides composing the wild-type sequence and all possible mutant sequences for a 
given region of a gene of interest can be placed on an array. Exposure of the array to DNA or 
RNA from a subject or biological specimen, under hybnd.zation conditions, allows 
determination of wild-type or mutant status for the gene of interest. See, for example, U.S. 
Patent No, 5,492,806; 5,525,464; 5,556,752; and PCT Publications WO 92/10588 and WO 
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96/17957. Both of these techniques require discrimination between related sequences, 
especially at the single-nucleotide level; hence, the enhanced discriminatory properties of the 
modified oligonucleotides of the invention will provide improvements in these techniques. 
Materials for construction of arrays include, but are not limited to, nitrocellulose, glass, 
silicon wafers, optical fibers and other materials suitable for construction of arrays such as 
are known to those of skill in the art. The synthesis of oligonucleotides arrays has been 
described in co-pending U.S. application Ser. No. 09/364,320, and suitable modifications can 
be made to the methods therein for preparing the arrays of the present invention. 

An additional application of the present invention to array technology is in the 
examination of patterns of gene expression in a particular cell or tissue. In this case, 
oligonucleotides or polynucleotides corresponding to different genes are arrayed on a surface, 
and a nucleic acid sample from a particular cell or tissue type, for example, is incubated with 
the array under hybndization conditions. Detection of the sites on the array at which 
hybridization occurs allows one to determine which oligonucleotides have hybridized, and 
hence which genes are active in the particular cell or tissue from which the sample was 
derived. 

Array methods can also be used for identification of mutations or 
polymorphisms, where wild-type and mutant sequences are placed in an ordered array on a 
surface. Hybndization of a polynucleotide sample to the array under stnngent conditions, 
and determination of which oligonucleotides in the array hybridize to the polynucleotide, 
allows determination of whether the polynucleotide possesses the wild-type or the mutant 
sequence. Since many mutant sequences of clinically-relevant genes differ from their wild- 
type counterpart at only one or a few nucleotide positions, the enhanced discriminatory 
powers of the modified oligonucleotides of the invention will provide improvements in 
mutation detection. 

In all of the above-mentioned applications of array technology, the increased 
discriminatory abilities of modified oligonucleotide provide significant improvements in 

sensitivity and resolving power. 

Efficient hybndization, for example in arrays, requires that the capture probes 
contain T m 's in a narrow temperature range. In one embodiment of the invention the T m of 
capture oligonucleotides for use in anays is balanced, eliminating the problem associated 
with GC and AT-nch sequences, by using a combination of 3'-MGBs and modified bases 
(e.g., PPPA and PPPU). Table 1 and Figure 2 illustrate the balancing of the T m of 
pynmidine rich probes with different G/C and A/T base pairs. To balance the T m of the 
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rnmnlementarv Targets 


1* 


TCGGCGGCGT 


2* 


ACAGCGGCGT 


3* 


ACAGCGACGT 


4* 


TCAGTGACGA 


5* 


TCAGTGACAA 


6* 


TCAATGACAG 


7* 


ACAATGATAA 


8* 


C C A AT A AT AA 


9* 


GTAATAATAA 



different probes, PPPA, PU and MGB were used in combination. The purine and pyrimidine- 
rich probes and complementary targets are shown in Table 1 and the T m s of these probes are 
shown in Figure 2. As shown in Figure 2 the T m difference between a GC-rich and an AT- 
rich probe can be more than 50°C 

Table 1 

Sequences of Oligonucleotide Probes and Complementary Targets 

base 

Py/Pu-rich probes pairs 
1* MGB-Q-CGCCGCCG 8 G/C 
2* MGB-Q-CGCCGCTG 7 G/C, 1 A/T 
3* MGB-Q-CGTCGCTG 6 G/C, 2 A/T 
4*.MGB-Q-CGTCACTG 5 G/C, 3 A/T 
5* MGB-Q-TGTCACTG 4 G/C, 4 A/T 
6* MGB-Q-TGTCATTG 3 G/C, 5 A/T 
T MGB-Q-TATCATTG 2 G/C, 6 A/T 
8* . MG B-Q-TATTATTG 1 G/C, 7 A/T 
9 * . M G B -Q-T ATTATT A 8 A/T 

As can be seen in Figure 2, modification with PPPA, PU and a MGB provides 

a probe set having balanced T m s. 

Efficient hybridization for example in arrays, requires that the probes contain 
T m 's in a narrow temperature range. Thus, in related embodiments, the modified 
oligonucleotides described herein can comprise any combination of normal bases, 
unsubstituted pyrazolo[3,4-d]pyrimidine bases (e.g., PPG and PPA), 3-substituted 
pyrazolo[3,4-d]pyrimidines, modified purine, modified pyrimidine, 5-substituted 
pyrimidines, universal bases and a minor groove binder to balance the T m of the 

oligonucleotide. 

The ability to predict and "level" the T m s of multiple modified 
oligonucleotides is particularly useful for the development of oligonucleotide arrays or for 
compositions having a plurality of oligonucleotides. Still further, the ability to predict T m s 
for modified oligonucleotides is useful for establishing appropriate conditions for 
hybridization, renaturation, mapping variations of base compositions of sequences, 
determination of sequence complexity and divergence. A T m prediction algorithm can be 
developed using models that assign thermodynamic contributions to duplex formation for all 
possible individual base pair nearest neighbors. Thermodynamic parameters for 10 
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independent nearest-neighbors of base pairs are available from published sources for 
DNA:RNA hybrids (see Sugimoto et al., Biochem. 34: 1 121 1-1 1216 (1995)) and dsDNA 
(SantaLucia et al, Biochem. 36:10581-10595 (1997)). The use of the nearest neighbor 
parameters to develop programs to calculate oligonucleotide duplex stability in terms of T m 
have been described (see, Schutz & von Ahsen, Biotechniques, 27:1218-22 (1999); Owczarzy 
et al, Biopolymers, 44:217-239 (1997); for PNA/DNA, see Geisen et al, Nucl. Acids Res., 
26:5004-5006 (1998); Blake et al, Bioinformatics, 15:370-5 (1999) and U.S. Patent No. 
5,556,749. . 

The basic principles of thermodynamic and kinetic description of mismatch 
discrimination are well known and are summarized below. 
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A probe-target interaction is an equilibrium reaction that can be described as 



A + B <=> AB where A = probe, B = DNA target 

5 

At the temperature T equilibrium of forward (association) and reverse 
(dissociation) reactions can be described by equilibrium constant K where : 

[AB] 

10 K = 



15 



[A][B] 



in terms of equilibrium thermodynamics 



K= exp(- AH° AB /RT + AS° AB /R); 

AG" AB~ AH°AB — TAS°AB y 

AG° AB = -RTLn(K) 

20 where : AH° AB , AS° AB and AG° AB - enthalpy, entropy and free energy changes for duplex 
AB formation. 

The duplex AB has N base pairs (N-l nearest neighbors) and the nearest- 
neighbor approximation assumes 

25 AH° AB = AH 0 , + AH° 2 +. • . AH 0 ,,., + AH 0 ,,,,, = Z(AH°; + AH° inU ) 

AS" ab = AS°, +AS°2 +..- AS° n ., + AS° init = ZfAS", + AS° init ) 
AG° AB = AG 0 , + AG 3 2 +••• AG? R ., + AG° inil = Z(AG°i + AG° Mi ) 

where : AH° , AS°j and AG° - enthalpy, entropy and free energy changes for each nearest- 
30 neighbor base pairs increment for duplex AB formation, AH°j n it , AS° n it and AG°jnit - 
thermodynamic parameters of initiation of duplex AB formation. 

In the case of total concentration equality, C pr ob e = Curget , an equation for Tm 
in terms of free energy, enthalpy and entropy can be arrived at as shown below: 

35 Z(AH°; + AH° in!l ) 

TM = 



Z(ASTi + AS° ini , ) + RLn[(C p , ob e+C la rg e( )/4J 

Similarly as shown above relationships can be developed when the probe 
40 hybridizes to a target that contains a mismatch, as shown below: 
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A + B* <=> AB* A = probe, B* = mismatched DNA target 



Thermodynamic parameters for mismatched duplex AB* are: K*, AH° AB * 
AS°ab* AG°ab* . In terms of nearest-neighbor parameters for a mismatched base pair 
5 at position "4", for example, involving nearest-neighbor "3" and "4" the discrimination 
parameters AAH° , A AS 0 , AAG° can be expressed as: 

AAH° = AH° AB - AH° AB * = (AH° 3 - AH°* 3 ) + (AH° 4 - AH°* 4 ) = AAH° 3 + AAH° 4 . 
AAS° = AS° AB - AS° AB * = (AS° 3 - AS°* 3 ) + (AS° 4 - AS°* 4 ) = AAS° 3 + AAS° 4 . 
10 AAG° = AG° ab - AGTab* = (ACT 2 - ACT * 3 ) + (AG° 4 - zJG°*J = AAG° 3 + AAG° 4 . 

In one embodiment of the invention, probe sequences are designed to 
determine the thermodynamic nearest-neighbor parameters for oligonucleotides containing 
normal purines, pyrimidines and a minor groove binder CDPI 3 attached to the 3'-end of an 

15 oligonucleotide. In another embodiment thermodynamic nearest-neighbor parameters for 
oligonucleotide containing normal pyrimidines, purines, a 3 '-end attached CDPI3, but 
wherein the guanine is replaced with 5-hydropyrazolo[3,4-d]pyrimin-4-one. These nearest- 
neighbor thermodynamic parameters are used in these two embodiments to calculate the T m s 
of these types of oligonucleotides with and without CDPI 3 - In the case where the CDPI3 was 

20 attached at base 1 at the 3 '-end the new T m prediction algorithm and formula were used as 
derived below. 

Z(AH°i + AH° in it ) 

25 T M = 

£ (AS°i + AST init ) + RLnf(C probe +C iarget )/4/ + AAS° M gb 



The model assumes that a MGB attached to an oligonucleotide has an 
30 additional pure entropic factor AAS° m cb. This value can be calculated according to 

algorithm : 6 bases (5 nearest-neighbor ) from 3 '-prime end are covered by the MGB from 
starting base 44 1" or "2". Each nearest-neighbors covered has unique value AAS 0 iM GB - 
Correction factor "A" are introduced in the case when adenosine can be found in base 
positions 6, 7 or 8. Statistical factor "n" is introduced additionally to factor "A" were n=3 for 
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-AAA-; n=2 for -AA-, -ANA-; n=l for -A-; n=0 for -AN- and -NA- (N is any base other than 
A). A final value AAS° m gb can be determined by the equation 

AAS°mgb = ZAAS?iMGB + A(optional) + R*Ln(N) (optional) 

5 

Tables 2a and 2b contain the nearest-neighbor thermodynamic parameters for 
oligonucleotides containing PPG alone and the entropic contribution of the MGB, 
respectively. In the latter case the contribution of fluorescein and a dark quencher is 
incorporated in the thermodynamic treatment and has generally been shown to be negligible. 
10 [The quencher is part of the linker between the MGB and the oligonulceotide] 

Tables 2a and 2b 

a) Nearest-neighbor parameters for PPG containing oligonucleotides 
b) The entropic contribution of the CDPI3. 

15 

a) 



Number 


5'-3' 


AH° 


AS° 


AG°(65) 


1 


AA 


-7850 


-22.3 


-324 


2 


AT 


-8180 


-23.0 


-411 


3 


AC 


-8450 


-22.6 


-795 


4 


AG 


-6560 


-17.2 


-730 


5 


TA 


-7230 


-21.8 


129 


6 


TT 


-7850 


-22.3 


-324 


7 


TC 


-8720 


-24.0 


-614 


8 


TG 


-6900 


-18.3 


-715 


9 


CA 


7530 


-20.3 


-652 


10 


CT 


-6390 


-17.3 


-548 


11 


CC 


-7860 


-20.1 


-1071 


12 


CG 


-6030 


-14.0 


-1288 


13 


GA 


-10070 


-27.4 


-795 


14 


GT 


-9110 


-23.5 


-1175 


15 


GC 


-13170 


-34.0 


-1665 


16 


GG 


-8080 


-20.2 


-1258 


17 


GCinit 


-48440 


-17.6 


1116 


18 


ATinit 


-1060 


-7.1 


1341 



b) 

Number 5*-3' AAS° 

1 AA 3.408 

2 AT 3.060 

3 AC 1.442 

4 AG 0.750 

5 TA 2.463 

6 TT 3.313 

7 TC 2.870 

8 TG 0.893 

9 CA 0.607 
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10 


CT 


O ICO 

2.253 


11 


CC 


0.905 


12 


CG 


-0.721 


13 


GA 


1.282 


14 


GT 


2.397 


15 


GC 


2.172 


16 


GG 


1.298 



In one preferred embodiment T m s of multiple modified oligonucleotides 
" containing the same number of bases are leveled using an algorithm to select nearest neighbor 

parameters from any combination of normal bases, universal bases, PPA, PPG, PPPA, PPPG, 
5 PU, PC, HOPU, HOBuU, HOBuC, (NH 2 ) 2 PPPA, (NH 2 ) 2 PPPAOH, (NH 2 ) 2 BuPPAOH, 

(NH 2 ) 2 PPAI, HOBuPPG, minor groove binder, fluorophore, quencher and a 

chemilumenescer. 

Alternatively the T m s of multiple modified oligonucleotides are substantially 
leveled for containing the same number of bases plus or minus 1 or 2 bases, with a Tm range 
10 ofabout±2°C. 

In some instances, modified bases are used that improve duplex stability in 
addition to those modified bases that decrease duplex stability. Modified bases that decrease 
duplex stability are well known, e.g. 7-deazaadenine and 7-deazaguanine. 

1 5 Use of Modified Oligonucleotides 

The modified oligonucleotides of the present invention provides numerous 
advantages over unmodified oligonucleotides, including superior mismatch discrimination. 
The modified oligonucleotides of the invention are particularly useful as probes, wherein 
20 their hybridization to a target sequence is detected, or as primers, wherein their hybridization 
to a target sequence is followed by polynucleotide synthesis initiated from the 3' terminus of 
the modified oligonucleotide, and the synthesized product {i.e., the extension product) is 
detected. 

The modified oligonucleotides of the present invention are useful in other 
25 techniques in which hybridization of an oligonucleotide to another nucleic acid is involved. 
These include, but are not limited to, techniques in which hybridization of an oligonucleotide 
to a target nucleic acid is the endpoint; techniques in which hybridization of one or more 
oligonucleotides to a target nucleic acid precedes one or more polymerase-mediated 
elongation steps which use the oligonucleotide as a primer and the target nucleic acid as a 
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template; techniques in which hybridization of an oligonucleotide to a target nucleic acid is 
used to block extension of another pnmer; techniques in which hybridization of an 
oligonucleotide to a target nucleic acid is followed by hydrolysis of the oligonucleotide to 
release an attached label; and techniques in which two or more oligonucleotides are 
> hybridized to a target nucleic acid and interactions between the multiple oligonucleotides are 
measured. Conditions for hybridization of oligonucleotides, and factors which influence the 
degree and specificity of hybridization, such as temperature, ionic strength and solvent 
composition, are well-known to those of skill in the art. See, for example, Sambrook et al, 
supra- Ausubel, et al, supra; M.A Innis et al (eds.) PCR Protocols, Academic Press, San 
0 Diego, 1990; B.D. Hames et al. (eds.) Nucleic Acid Hybridisation: A Practical Approach, 
IRL Press, Oxford, 1985; and van Ness et al. (1991) Nucleic Acids Res. 19:5143-5151. 

Hybndization of probes and/or primers to target sequences proceeds according 
to well-known and art-recognized base-pairing properties, such that adenine base-pairs with 
thymine or uracil, and guanine base-pairs with cytosine. The property of a nucleotide that 
5 allows it to base-pair with a second nucleotide is called complementarity. Thus, adenine is 
complementary to both thymine and uracil, and vice versa; similarly, guanine is 
complementary to cytosine and vice versa. An oligonucleotide which is complementary 
along its entire length with a target sequence is said to be perfectly complementary, perfectly 
matched, or fully complementary to the target sequence, and vice versa. An oligonucleotide 
20 and its target sequence can have related sequences, wherein the majonty of bases in the two 
sequences are complementary, but one or more bases are noncomplementary, or mismatched. 
In such a case, the sequences can be said to be substantially complementary to one another. 
If the sequences of an oligonucleotide and a target sequence are such that they are 
complementary at all nucleotide positions except one, the oligonucleotide and the target 
25 sequence have a single nucleotide mismatch with respect to each other. 

The modified pyrazolo[3,4-d]pyrimidine nucleotides of the invention retain 
the base-pairing specificity of their naturally-occumng analogues; PPPG analogues are 
complementary to cytosine, while PPPA analogues are complementary to thymine and uracil. 
The PPPG and PPPA analogues not only have a reduced tendency for so-called "wobble 
30 pairing with non-complementary bases, compared to guanine and adenine, but the 3- 

substituted groups increase binding affinity in duplexes. Similarly, modified pyridines 
hybridize specifically to their naturally occurring counter partners. 

Conditions for hybridization are well-known to those of skill in the art and can 
be varied within relatively wide limits. Hybridization stringency refers to the degree to 
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which hybridization conditions disfavor the formation of hybrids containing mismatched 
nucleotides, thereby promoting the formation of perfectly matched hybrids or hybrids 
containing fewer mismatches; with higher stringency correlated with a lower tolerance for 
mismatched hybrids. Factors that affect the stringency of hybridization include, but are not 
limited to, temperature, pH, ionic strength, concentration of organic solvents such as 
formamide and dimethylsulfoxide and chaotropes. As is well known to those of skill in the 
art, hybridization stringency is increased by higher temperatures, lower ionic strengths, and 
lower solvent concentrations. See, for example, Ausubel et al, supra; Sambrook et al, 
supra; M.A. Innis et al. (eds.) PCR Protocols, Academic Press, San Diego, 1990; B.D. 
Hames et al. (eds.) Nucleic Acid Hybridisation: A Practical Approach, IRL Press, Oxford, 
1985; and van Ness et al., (1991) Nucleic Acids Res. 19:5143-5151. 

Thus, in the formation of hybrids (duplexes) between an oligonucleotide and 
its target sequence, the oligonucleotide is incubated in solution, together with a 
polynucleotide containing the target sequence, under conditions of temperature, ionic 
strength, pH, etc, that are favorable to hybridization, i.e., under hybridization conditions. 
Hybridization conditions are chosen, in some circumstances, to favor hybridization between 
two nucleic acids having perfectly-matched sequences, as compared to a pair of nucleic acids 
having one or more mismatches in the hybridizing sequence. In other circumstances, 
hybridization conditions are chosen to allow hybridization between mismatched sequences, 
favoring hybridization between nucleic acids having fewer mismatches. 

The degree of hybridization of an oligonucleotide to a target sequence, also 
known as hybridization strength, is determined by methods that are well-known in the art. A 
preferred method is to determine the T m of the hybrid duplex. This is accomplished, as 
described supra, by subjecting a duplex in solution to gradually increasing temperature and 
monitoring the denaturation of the duplex, for example, by absorbance of ultraviolet light, 
which increases with the unstacking of base pairs that accompanies denaturation. T m is 
generally defined as the temperature midpoint of the transition in ultraviolet absorbance that 
accompanies denaturation. Alternatively, if T m s are known, a hybridization temperature (at 
fixed ionic strength, pH and solvent concentration) can be chosen that it is below the T m of 
the desired duplex and above the T m of an undesired duplex. In this case, determination of 
the degree of hybridization is accomplished simply by testing for the presence of hybridized 
probe. 

If a probe comprises a detectable label, assays for hybridized probe are usually 
designed to detect the presence of label in duplex material. This can be accomplished, for 
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example, by specifically selecting duplex material, specifically destroying single-stranded 
material, or utilizing some combination of these methods. For example, hybridization 
reaction mixtures can be subjected to high-stringency conditions and/or single strand-specific 
nucleases; or duplexes can be punfied by affinity techniques specific for double-stranded, as 
opposed to single-stranded, nucleic acids. In a preferred embodiment of the invention, 
duplexes are detected by release of label from a probe under conditions in winch label xs 
released only when the probe is in a duplex. Another embodiment requires the seperation of 
the label and quenchers when hybridized to the target. 

Detectable labels or tags suitable for use with nucleic acid probes are well- 
known to those of skill in the art and include, but are not hmhed to, radioactive isotopes, 
chromophores, fluorophores, chemiluminescent and electrochemiluminescent agents, 
magnetic labels, microspheres, colloidal metal(Taton et al, Science 289:1757-1760 (2000)), 
unmunologic labels, Hgands and enzymatic labels. Suitable labels also include mass labels 
and those used in deconvolution of combinatorial chemistry Hbranes, for example, tags that 
can be recognized by high performance liquid chromatography (HPLC), gas chromatography, 
mass spectrometry, optical imaging fibers, surface plasmon resonance, correlation 
spectroscopy, nanotechnology (Guetence et al, J. Chromatogr. B. Biomed. Sci. Appl. 

739:139-150 (2000)) and the like. 

Methods for labeling of oligonucleotides are well-known to those of skill in 
the art and include, for example, chemical and enzymatic methods. By way of example, 
methods for incorporation of reactive chemical groups into oligonucleotides, at specific sites, 
are well-known to those of skill in the art. Oligonucleotides containing a reactive chemical 
group located at a specific site, can be combined with a label attached to a complementary 
reactive group (e.g., an oligonucleotide containing a nucleophilic reactive group can be 
reacted with a label attached to an electrophilic reactive group) to couple a label to a probe by 
chemical techniques. Exemplary labels and methods for attachment of a label to an 
oligonucleotide are described, for example, in U.S. Patent No. 5,824,796; U.S. Patent No. 
5 210 015- Kess.er (ed.), Nonradioactive Labeling and Detection of Biomolecules. Spnnger- 
Verlag, Berlin, 1 992; Kncka (ed.) Nonisotopic DNA Probe Technics, Academic Press, San 
Diego 199> Howard (ed.) Methods in Nonradioactive Detection, Appleton & Lange, 
Norwalk 1993. Non-specific chem.cal labeling of an oligonucleot.de can be achieved by 
combining the oligonucleotide with a chemical that reacts, for example, with a particular 
functional group of a nucleotide base, and simultaneously or subsequently reacting the 
oligonucleotide with a label. See, for example, Draper et al. (1980) Biochemistry 19:1774- 
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1781. Enzymatic incorporation of label into an oligonucleotide can be achieved by 
conducting enzymatic modification or polymerization of an oligonucleotide using labeled 
precursors, or by enzymatically adding label to an already-existing oligonucleotide. See, for 
example, U.S. Patent No. 5,449,767. Examples of modifying enzymes include, but are not 
limited to, DNA polymerases, reverse transcriptases, RNA polymerases, etc. Examples of 
enzymes which are able to add label to an already-existing oligonucleotide include, but are 
not limited to, kinases, terminal transferases, ligases, glycosylases, etc. 

If an oligonucleotide is capable of acting as a primer, the degree of 
hybridization of the oligonucleotide can also be determined by measuring the levels of the 
extension product of the primer. In the case, either the primer can be labeled, or one or more 
of the precursors for polymerization (normally nucleoside triphosphates) can be labeled. 
Extension product can be detected, for example, by size {e.g., gel electrophoresis), affinity 
methods, or any other technique known to those of skill in the art. 

Primer extension ("minisequencing", "genetic bit analysis") assays are 
commonly used for SNP typing and have the potential to be used in other genotyping and 
mutation screening applications (Pastinen T. et al., Genome Res., 10:1031-42 (2000)). In 
certain embodunents modified bases and minor groove binders improve primer extension 
assays in several ways. The added duplex stability provided by MGB, or 5-substituted 
pyrimidine or 3-substituted pyrazolo[3,4-d]pyrimidine enables extensions to be performed at 
elevated temperatures. This is advantageous because problematic secondary structures in 
target molecules are eliminated at elevated temperatures. Also, hybridization of target to 
primer is faster at higher temperature. Thermostable polymerases such as Taq polymerase 
and Bst DNA polymerase may be used in such reactions. 

Furthermore, MGBs and modified bases improve the specificity of assays by 
eliminating one class of false postitive signals. Primer sequences that form hairpin structures 
or homodimers are prone to template-independent extension (the 5' end of the primer 
functions as template), resulting in false positive signal. MGBs and modified bases on 
"templates" inhibit extension by DNA polymerases. Thus, MGBs on the 5' end, or modified 
bases on the 5' end or middle of a primer, can prevent extension (false positives) from primer 
hairpins or primer dimers. Finally, PPG can be used to eliminate non-canonical structures 
formed by G-rich oligonucleotides, enabling primer extension assays in such sequences. 

Other assays in which the present modified oligonucleotides are particularly 
useful are described in co-pending application Ser. No. 09/054,832. 
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Still other amplification assays in which modified oligonucleotides are useful 
include the amplification assays based on the invasive cleavage of oligonucleotide probes by 
flap endonucleases (Lyamichev et al., Nature BiotechnoL, 17:292-296 (1999) and Olson, et 
al., High-Throughput Gene Expression Monitoring with the Invader® Assay, Poster, Society 
for Biomolecular Screening Conference, Vancouver, British Columbia, Canada, 2000); self- 
sustained sequence replication type assays (Mueller et al, Histochem. Cell Biol, 108:431-437 
(1997)) and the like. Surprisingly, non-natural bases can be substituted in both the invader 
and genomic probes of a cleavase-based assay. These modifications include but are not 
limited to pyrazolo[3,4-d]pyrimidines, 3-substituted pyrazolo[3,4-d]pyrimidines and 5- 
substituted pyrimidines. Non-natural backbones are also included such as monomers used in 
peptide nucleic acids, locked nucleic acids etc. Still further, the modified oligonucleotides 
can have attached minor groove binders, fluorophores, quenchers, and the like. One of skill 
in the art will appreciate that chimeras can also be used to allow optimal enzyme activity and 
performance. 

In the present invention, marked improvements are unexpectedly shown in 
cleavase-based assays, when modified bases were substituted for normal bases in both the 
invader and genomic probes. Thus, the use of the modified bases such as 
hydoxypropynylPPA (HOPPPA), (NH 2 ) 2 PPPAOH and 3-iododiaminoPPA allowed one to 
decrease the length of both the invader and genomic probes, yet provide improved assay 
performance. In another embodiment non-natural bases are also incorporated in the cassette 
probe. 

In view of the above, the present invention provides in one aspect, a method 
for distinguishing polynucleotides with related sequences, the method comprising: 

(a) contacting a modified oligonucleotide having a defined sequence 
comprising at least one 3-substituted pyrazolo[3,4-d]pyrimidine or 5-substituted pyrimidine 

' in place of a purine or pyrimidine base with at least two polynucleotides, wherein one of the 
polynucleotides has a target sequence that is perfectly complementary to the modified 
oligonucleotide and at least one of the other polynucleotides has a target sequence with at 

least one base mismatch; and 

(b) determining the degree of hybridization between the modified 

oligonucleotide and each of the polynucleotides. 

Preferably, at least one of the other polynucleotides has a target sequence with 
one or two base mismatches, more preferably only one base mismatch. 
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As noted above, a target sequence refers to a nucleotide sequence which 
comprises a site of hybridization for a probe or a primer. Target sequences can be found in 
any nucleic acid including, but not limited to, genomic DNA, cDNA, RNA and any amplified 
product thereof, and can comprise a wild-type gene sequence, a mutant gene sequence, a non- 
coding sequence, a regulatory sequence, etc. A target sequence will generally be less than 
100 nucleotides, preferably less than 50 nucleotides, and most preferably, less than 21 

nucleotides in length. 

The modified oligonucleotides used in this aspect of the invention are 
essentially modified probes and the polynucleotides can be distinguished by determining 
which polynucleotides hybridizes to the modified probe. The modified probes can be labeled 
with any detectable label, or the probe can have the capacity to become labeled either before 
or after hybridization, such as by containing a reactive group capable of association with a 
label or by being capable of hybridizing to a secondary labeled probe, either before or after 
hybridization to the target. Conditions for hybridization of nucleic acid probes are well- 
known to those of skill in the art. See, for example, Sambrook et al, supra; Ausubel et al, 
supra; Innis et al., supra; Hames et al. supra; and van Ness et al, supra. 

Hybridization can be assayed (i.e., hybridized nucleic acids can be identified) 
by distinguishing hybridized probe from free probe by one of several methods that are well- 
known to those of skill in the art. These include, but are not limited to, attachment of target 
nucleic acid to a solid support, either directly or indirectly (by hybridization to a second, 
support-bound probe or interaction between surface-bound and probe-conjugated ligands) 
followed by direct or indirect hybridization with probe, and washing to remove unhybridized 
probe; determination of nuclease resistance; buoyant density determination; affinity methods 
specific for nucleic acid duplexes (e.g., hydroxyapatite chromatography); interactions 
between multiple probes hybridized to the same target nucleic acid; etc. See, for example, 
Falkow et al, U.S. Patent No. 4,358,535; Urdea et al, U.S. Patent Nos. 4,868,105 and 
5,124,246; Freifelder, Physical Biochemistry, Second Edition, W. H. Freeman & Co., San 
Francisco, 1982; Sambrook, et al, supra; Ausubel* al, supra; Hames et al. supra; and 
other related references. The duplex-stabilizing capability of MGB-modif,ed oligonucleotide 
conjugates makes hybridization possible under more stringent conditions, wherein potentially 
occluding secondary structure in the target nucleic acid can be minimized. Accordingly, such 
MGB-modif,ed oligonucleotides are particularly preferred in this aspect of the invention. 

In a related aspect, the present invention provides a method for detecting the 
presence of a target sequence in a polynucleotide, the method comprising: 
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(a) incubating a polynucleotide to be tested for the presence of the target 
sequence with a modified oligonucleotide having a sequence that is substantially 
complementary to the target sequence under hybridization conditions; and 

(b) identifying hybridized nucleic acids; 

wherein the modified oligonucleotide comprises at least one 3-substituted 
pyrazolo[3,4-(/]pyrimidine in place of a purine residue. 

Preferably, the modified oligonucleotide is a labeled probe and has at least two 
3-substituted pyrazolo[3 ) 4-d]pyrimidine bases. In this group of embodiments, a labeled 
probe is hybridized to a target and/or an extension product of a target, and a change in the 
physical state of the label is effected as a consequence of hybridization. As used herein, a 
"probe" is a nucleic acid molecule that is capable of hybridizing to a target sequence in a 
second nucleic acid molecule. By way of example, one assay of this type, the hydrolyzable 
probe assay, takes advantage of the fact that many polymerizing enzymes, such as DNA 
polymerases, possess intnnsic 5'-3' exonucleolytic activities. Accordingly, if a probe is 
hybridized to a sequence that can serve as a template for polymerization (for instance, if a 
probe is hybridized to a region of DNA located between two amplification primers, during 
the course of an amplification reaction), a polymerizing enzyme that has initiated 
polymerization at an upstream amplification primer is capable of exonucleolytically digesting 
the probe. Any label attached to such a probe will be released, if the probe is hybridized to 
its target and if amplification is occurring across the region to which the probe is hybridized. 
Released label is separated from labeled probe and detected by methods well-known to those 
of skill in the art, depending on the nature of the label. For example, radioactively labeled 
fragments can be separated by thin-layer chromatography and detected by autoradiography; 
while fluorescently-labeled fragments can be detected by irradiation at the appropriate 
excitation wavelengths with observation at the appropriate emission wavelengths. See, e.g., 

U.S. Patent No. 5,210,015. 

In a variation of this technique, a probe contains both a fluorescent label and a 
quenching agent, which quenches the fluorescence emission of the fluorescent label. In this 
case, the fluorescent label is not detectable until its spatial relationship to the quenching agent 
has been altered, for example by exonucleolytic release of the fluorescent label from the 
probe. Thus, prior to hybridization to its target sequence, the dual fluorophore/quencher 
labeled probe does not emit fluorescence. Subsequent to hybridization of the 
fluorophore/quencher-labeled probe to its target, it becomes a substrate for the exonucleolytic 
activity of a polymerizing enzyme which has initiated polymerization at an upstream primer. 
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Exonucleolytic degradation of the probe releases the fluorescent label from the probe, and 
hence from the vicinity of the quenching agent, allowing detection of a fluorescent signal 
upon irradiation at the appropriate excitation wavelengths. This method has the advantage 
that released label does not have to be separated from intact probe. Multiplex approaches 
utilize multiple probes, each of which is complementary to a different target sequence and 
carries a distinguishable label, allowing the assay of several target sequences simultaneously. 

The use of MGB-modified oligonucleotide conjugates in this and related 
methods allows greater speed, sensitivity and discriminatory power to be applied to these 
assays. In particular, the enhanced ability of MGB-modified oligonucleotide conjugates to 
allow discrimination between a perfect hybrid and a hybrid containing a single-base 
mismatch will facilitate the use of hydrolyzable probe assays in the identification of single- 
nucleotide polymorphisms and the like. One of skill in the art will appreciate that 
compositions and methods, such as those of the invention, that are capable of discriminating 
single-nucleotide mismatches will also be capable of discriminating between sequences that 
have 2, 3, 4, 5, or even 6 or more mismatches with respect to one another. 

In yet another related aspect, the present invention provides a method for 
primer extension, the method comprising incubating a polynucleotide containing a target 
sequence with one or more oligonucleotide primers complementary to the target sequence, in 
the presence of a polymerizing enzyme and nucleotide substrates under conditions favorable 
for polymerization; wherein at least one of the oligonucleotide primers contains a 3- 
substituted pyrazolo[3,4-</]pyrimidine or a 5-substituted pyrimidine base in place of a purine 

or pyrimidine base. 

Amplification procedures are those in which many copies of a target nucleic 
acid sequence are generated, usually in an exponential fashion, by sequential polymerization 
and/or ligation reactions. In addition to the more traditional amplification reactions discussed 
below, the present invention is useful in amplifications involving three-way junctures (see, 
WO 99/37085), signal amplification (see Capaldi, et al., Nuc. Acids Res., 28:E21 (2000)), T7 
polymerases, reverse transcriptase, RNase H, RT-PCR, Rolling Circles, cleavase and the like. 

Many amplification reactions, such as PCR, utilize reiterative primer- 
dependent polymerization reactions. A primer is a nucleic acid that is capable of hybridizing 
to a second, template nucleic acid and that, once hybndized, is capable of being extended by 
a polymerizing enzyme (in the presence of nucleotide substrates), using the second nucleic 
acid as a template. Polymenzing enzymes include, but are not limited to, DNA and RNA 
polymerases and reverse transcriptases, etc. Conditions favorable for polymerization by 
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different polymerizing enzymes are well-known to those of skill in the art. See, for example, 
Sambrook^/.,™^; Ausubel, et al., supra; Innis et al. supra. Generally, in order to be 
extendible by a polymerizing enzyme, a primer must have an unblocked 3'-end, preferably a 
free 3' hydroxyl group. The product of an amplification reaction is an extended primer, 
wherein the primer has been extended by a polymerizing enzyme. 

Thus, in one embodiment of the invention, the methods and compositions 
disclosed and claimed herein are useful in improved amplification reactions such as PCR. 
See, e.g., U.S. Patents 4,683,202; 4,683,195 and 4,800,159; Mullis and Faloona, supra; and 
Saiki et aL supra. The polymerization step of PCR is most often catalyzed by a thermostable 
polymerizing enzyme, such as a DNA polymerase isolated from a thermophilic bacterium, 
because of the elevated temperatures required for the denaturation step of PCR. As discussed 
supra, one of the problems heretofore associated with the practice of PCR is the requirement 
for relatively long oligonucleotide primers, having sufficient hybrid stability to serve as 
primers at the elevated temperatures under which PCR is conducted. Modified 
oligonucleotides and especially MGB-modified oligonucleotide conjugates are useful as 
primers in amplification reactions such as PCR, as the modified bases and MGBs increase 
hybrid stability, thereby significantly extending the lower limit of useful primer length. In 
addition, MGB-modified oligonucleotide conjugates are useful in specialized PCR protocols 
wherein reduced primer length is desirable. These include, but are not limited to, differential 
display, in which optimal primer length is below 10 nucleotides, random amplification of 
polymorphism in DNA (RAPD) techniques, and amplification length polymorphism 
analyses. Liang et al, supra; Williams et al., supra. 

The modified oligonucleotides of the present invention are applicable to any 
type of assay or procedure in which PCR or a related amplification technique is used, 
including, but not limited to, hydrolyzable probe assays, priming with allele-specific 
oligonucleotides (ASOs), fragment length polymorphism analysis, single nucleotide 
polymorphism (SNP) analysis and microsatellite analysis, for example. These and other 
techniques are useful in gene mapping, in the identification and screening of disease-related 
genes, and in pharmacogenetics, to name just a few applications. 

In still another related aspect, the present invention provides a method for 
determining the nucleotide sequence of a polynucleotide, the method comprising: 

(a) incubating the polynucleotide with a modified oligonucleotide array under 
hybridization conditions; and 
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(b) determining to which of the modified oligonucleotides in the array the 

polynucleotide hybridizes; 

wherein the modified oligonucleotides comprise at least one 3-substituted 

pyrazolo[3,4-rflpyrimidine in place of a purine base. 

In these procedures, an ordered array comprising a plurality of modified 
oligonucleotides of different known sequences is used as a platform for hybridization to one 
or more test polynucleotides, nucleic acids or nucleic acid populations. Determination of the 
oligonucleotides which are hybridized and alignment of their known sequences allows 
reconstruction ofthe sequence of the test polynucleotide. See, for example, U.S. Patent Nos. 
5 492 806; 5,525,464; 5,556,752; and PCT Publications WO 92/10588 and WO 96/17957. 
Materials for construction of arrays include, but are not limited to, nitrocellulose, glass, 
silicon wafers, optical fibers and other materials suitable for construction of arrays such as 

are known to those of skill in the art. 

A major problem with current array-based sequencing and analysis methods is 
that the different oligonucleotides in an array will each have a different T m . Hence, it is 
difficult to determine the stringency conditions that will provide maximum sensitivity, while 
retaining the ability to distinguish single-base mismatches. This is a particularly important 
consideration for most, if not all, applications of array technology. Use of modified 
oligonucleotides and/or MGB-modified oligonucleotide conjugates in array-based sequencing 
and analysis techniques provides a solution to this problem. Surprisingly, conjugation of a 
MGB to a modified oligonucleotide makes its T m relatively independent of base composition. 
Thus for a population of modified oligonucleotides and MGB-modified oligonucleotide 
conjugates of a given length, the T m for a perfect hybrid falls within a relatively narrow 
temperature range regardless of sequence. At the same time, the T m for a single nucleotide 
mismatch is well below the T m ofthe perfect match. Thus, arrays designed such that all 
modified oligonucleotides are the same length and are optionally present as their MGB 
conjugates exhibit minimal variation in T m among the different oligonucleotides in the array, 
enabling more uniform hybridization conditions for the entire array. A further advantage to 
the use of modified oligonucleotides and MGB-modified oligonucleotide conjugates ,n these 
techniques is that it provides greater sensitivity, by allowing the use of shorter 
oligonucleotides, at higher temperatures (and hence higher stringency), while retaining 

single-nucleotide resolution. 

An additional application ofthe present invention to array technology is in the 
examination of patterns of gene expression in a particular cel. or tissue (see, generally, Eisen, 
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et al METHODS IN ENZYMOLOGY, 303:179-205 (1999)). In this case, modified 
oligonucleotides or polynucleotides corresponding to different genes are arrayed on a surface, 
and a nucleic acid sample from a pabular cell or tissue type, for example, is incubated with 
the array under hybridization conditions. Detection of the sites on the array at which 
5 hybridization occurs allows one to determine which modified oligonucleotides have 

hybridized, and hence which genes are active in the particular cell or tissue from which the 

sample was derived. 

Array methods can also be used for identification of mutations or 
polymorphisms, where wild-type and mutant sequences are placed in an ordered array on a 
10 surface (see, Hacia, et al., J. Mol. Genet., 36:730-736 (1999)). Hybridization of a 

polynucleotide sample to the array under stringent conditions, and determination of which 
oligonucleotides in the array hybridize to the polynucleotide, allows determination of whether 
the polynucleotide possesses the wild-type or the mutant sequence. The increased 
discriminatory abilities of MGB-oligonucleotide conjugates are especially useful m this 

1 5 application of array technology. 

Accordingly, the present invention provides a method for identifying a 
mutation in a target sequence of a gene of interest, the method comprising: 

(a) incubating a polynucleotide comprising the target sequence with an array 
of oligonucleotides of different sequences, wherein the different sequences include the wild- 
20 type target sequence and different mutant target sequences, under hybridation conditions; 

^ (b ) determining which of the oligonucleotides in the array become hybridized 

to the polynucleotide; 

wherein one or more purine residues in a plurality of the oligonucleotides are 

25 replaced with a 3-substituted pyrazolo[3,4-</]pyrimidine. 

In yet another related aspect, the present invention provides a method 
determining the nucleotide sequence of a target sequence in a polynucleotide, the method 
comprising: 

(a) contacting a polynucleotide composing the target sequence with at least 
30 two oligonucleotides of known sequence wherein one or more purine residues of the 

oligonucleotides are replaced by a 3-substituted pyrazoloP.^pynmidine, and wherein one 
of the at least two oligonucleotides has a sequence that is perfectly complementary to the 
target sequence and at least one other of the oligonucleotides has a related target sequence 
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and incubating each of the oligonucleotides with the polynucleotide under hybridization 
conditions; and 

(b) determining the degree of hybridization between each of the 
oligonucleotides and the polynucleotide. 

In one embodiment, a collection of all possible n-mer oligonucleotides (where 
n is an integer less than about 10) are used in a hydrolyzable probe assay to determine a 
nucleotide sequence. Each oligonucleotide is uniquely labeled (and preferably modified) and 
analysis of released label indicates which of the oligonucleotides has hybridized to the target 
sequence. Alignment of the sequences of the oligonucleotides which have hybridized 

provides the nucleotide sequence. 

Modified oligonucleotides, and more preferably modified oligonucleotide- 
MGB conjugates are also useful in primer-dependent methods of DNA sequencing, such as 
the chain-termination method and its derivatives, originally described by Sanger et al., supra. 
Use of MGB-modified oligonucleotide conjugates in chain-termination sequencing allows the 
use of shorter primers at higher stringency, and enables a greater degree of mismatch 
discrimination during sequencing. Examples include, but are not limited to, a search for 
genes.sharing a short region of homology (on the order of a few amino acids) and sequencing 
in a region in which very little existing sequence information is available. MGB- 
oligonucleotide conjugates are useful in such short primer sequencing techniques. 

In still another embodiment, the present invention provides a method for 
examining gene expression in a cell, the method comprising: 

(a) incubating a population of polynucleotides representative of the genes 
expressed in the cell with an oligonucleotide array comprising a plurality of modified 
oligonucleotides of different sequences under hybridization conditions, and 

(b) determining which of the modified oligonucleotides in the array become 

hybridized to polynucleotides; 

wherein said modified oligonucleotides comprise at least one 3-substituted 

pyrazolo[3,4-c/]pyrimidine in place of a purine. 

In one group of embodiments, the method is carried out with a plurality of 
expressed genes from multiple cell-types or tissues. The genes are preferably labeled with 
different detection labels, then hybridized to an array at the same time and monitored via the 
label to determine each gene's expression pattern. 

In addition to the assays and diagnostic methods described above, the 
modified oligonucleotides and modified bases described herein will have utility in antisense 
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technologies. Antisense oligonucleotides are known to selectively inhibit gene expression 
and provide a genetic approach for disease treatment and prevention (Smith et al, Int. J. 
Oncol, 17:841-850 (2000)). Other modified oligonucleotides have been used to improve the 
performance of antisense oligonucleotides ( Zhang et al, Nat. BiotechnoL, 18:862-867 
5 (2000); Flanagan et al, Nat, BiotechnoL, 14:1 139-1 145 (1996)). Accordingly, another 

embodiment of the present invention is the use of the 5-substituted pyrimidines, unsubstituted 
pyrazolo[3,4-d]pyrimidines and 3-substituted pyrazolo[3,4-d]pyrimidine as monomers, either 
alone or in any combination, in the synthesis of antisense oligomers. In another embodiment 
the modified bases described herein can be used as monomers in an oligonucleotide to 
10 decrease enzymatic degradation of the antisense oligomers. 



New Modified Bases 



In another aspect, the present invention provides a number of new modified 
15 bases. These bases have the general formula: 



Y 2 — i B 




wherein Z 1 is a member selected from the group consisting of H, F and OR a wherein R a is a 
member selected from the group consisting of H, (Ci-C8)alkyl and a hydroxy protecting 
group; Z 2 is a member selected from the group of H, (Ci-C 8 )alkyl, or is optionally combined 

20 with Z 1 to form a five- to seven-membered ring, having from one to three heteroatoms 

selected from the group consisting of O, S and N; Y 1 is a member selected from the group 
consisting of OH, a protected hydroxy group and O-P 1 , wherein P 1 is a phosphoramidite or 
H-phosphonate group; Y 2 is a member selected from the group consisting of OH, a protected 
hydroxy group and O-P 2 , wherein P 2 is a phosphoramidite, H-phosphonate, monophosphate, 

25 diphosphate or triphosphate; and B is a modified nucleotide selected from the group 
consisting of: 

O X 11 O R 12 X 11 R 12 

tY" xY" xY - x 




i T " .!. 



N 



wherein X" and X 12 are each independently selected from the group consiting of H, NH 2 and 
a protected amino group; each R 11 is independently selected from, the group consisting of 
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protected or unprotected forms of 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3- 
methoxyprop-l-ynyl, 4-hydroxy- 1 -butynyl, 4-amino-l-butynyl and 3-(hydroxymethyl)-4- 
hydroxy-l-butynyl; and each R 12 is independently selected from the group consisting of 
heterocyclyl(C,-Ci2)alkyl,heterocyclyl(C 2 -Ci2)alkenyl, heterocyclyl(C 2 -C 12 )alkynyl ) 
heterocyclyl, 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3-methoxyprop-l-ynyl, 4-hydroxy- 
1 -butynyl, 4-amino-l-butynyl and 3-(hydroxymethyl)-4-hydroxy-l -butynyl, with the proviso 
that R 12 is other than 2-pyridylethynyl. 

In one group of embodiments, B is 



X 11 R 12 




N 



Preferably, X n is NH 2 and X 12 is H. More preferably, X u is NH 2 , X 12 is H, Y 1 is O-P 1 , Y 2 is 
a protected hydroxy, Z 1 is H, and R 12 is selected from 3-hydroxyprop-l-ynyl, 3-aminoprop-l- 
ynyl, 4-hydroxy-l-butynyl, 4-amino-l-butynyl, 3-(hydroxymethyl)-4-hydroxy-l -butynyl, 3- 
methoxyprop-l-ynyl, 2-furanyl, 3-furanyl, 2-thienyl, 3-thienyl, 2-pyrrolyl, 3-pyrrolyl, 1,3- 
isoxazol-4-yl, l,3-isoxazol-5-yl, l,3-isoxazol-2-yl, 2-imidazolyl, 4-imidazolyl, 2-pyridyl, 3- 
1 5 pyridyl and 4-pyridyl. In particularly preferred embodiments, Y 1 is -0-[(2-cyanoethyl) N,N- 
diisopropylphosphoramidite] and Y 2 is -0-(4,4'-dimethoxytrityl). 
In another group of embodiments, B is 

O R 12 

^' 
i 



Preferably, X 12 is NH 2 or H. More preferably, X 12 is NH 2 or H, Y 1 is O-P 1 , Y 2 is a protected 
20 hydroxy, Z 1 is H, and R 12 is selected from 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3- 
(hydroxymethyl)-4-hydroxy-l-butynyl, 3-methoxyprop-l-ynyl, 2-furanyl, 3-furanyl, 2- 
thienyl, 3-thienyl, 2-pyrrolyl, 3-pyrrolyl, l,3-isoxazol-4-yl, l,3-isoxazol-5-yl, l,3-isoxazol-2- 
yl, 2-imidazolyl, 4-imidazolyl, 2-pyridyl, 3-pyridyl and 4-pyridyl. In particularly preferred 
embodiments, Y 1 is -0-[(2-cyanoethyl) N,N-diisopropylphosphoramidite] and Y 2 is - 

25 0-(4,4'-dimethoxytrityl). 

In still other preferred embodiments, the modified oligonucleotides of the 
invention have at least one base of formula III (including those in which the acetyl protecting 
groups are removed, and including other protected forms and activated forms thereof). 
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P <A 



HN 

HzN-^n^N 



Preparation of compounds of formula III (including compounds wherein the 6- 
amino group is protected and the hydroxy groups of the sugar are either protected or activated 
as a phosphoramidite) are provided in Reaction Scheme 6, above. 

In the most preferred embodiments of the invention, the modified bases are 
selected from formulae IVa, IVb or IVc. 




Me 2 N 

DMTrO / \/°vJ 



DMTrO 



-CN 



TV 




P/a 



IVb 
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These compounds are particularly well-suited for use in automatic 
oligonucleotide synthesizers, and in preparing certain modified oligonucleotides described 
herein. 

Other modified bases that are useful in the present invention include those 
5 represented by formulae Va and Vb: 

\=y nh 

O 



10 



15 




DMTrO 



Va 




wherein R 7 is selected from OH, SH or NH 2 . These compounds can be synthesized by 
methods described herein. 

In another group of embodiments, B is 



/11 



N 

O^N 




Preferably, X 1 ' is NH, More preferably, X 11 is NH 2 , Y 1 is O-P 1 , Y 2 is a protected hydroxy, 
Z 1 is H, and R M is selected from 3-hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 4-hydroxy-l- 
butynyl, 4-amino-l-butynyl, 3-(hydroxymethyl)-4-hydroxy-l-butynyl, and 3-methoxyprop-l 
ynyl. In particularly preferred embodiments, Y 1 is -0-[(2-cyanoethyl) N,N- 
diisopropylphosphoramidite] and Y 2 is -0-(4,4 , -dimethoxytrityl). 
In another group of embodiments, B is 

O 



HN 

J- 




,11 



20 



Preferably, Y 1 is O-P 1 , Y 2 is a protected hydroxy, Z 1 is H, and R u is selected from 3- 
hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 4-hydroxy- 1 -butynyl, 4-amino-l-butynyl, 3- 



61 



(hydroxymethyl)-4-hydroxy-l-butynyl, and 3-methoxyprop-l-ynyl. In particularly preferred 
embodiments, Y 1 is -0-[(2-cyanoethyl) N.N-diisopropylphosphoramidite] and Y 2 is - 

0-(4,4 ' -dimethoxytri tyl) . 

In a related aspect, the present invention provides modified oligonucleotides 

having the formula: 



B 




wherein each Z 1 is independently selected from H, F and OR* wherein R a is a member 
selected from H, (C,-C 8 )alkyl and a hydroxy protecting group; each Z 2 is H or (C.-Cg^lkyl, 
or is optionally combined with Z 1 to form a five- to seven-membered ring; each Z 3 is selected 

10 from O, S or NH; each Y is independently selected from P(0)OH, P(S)OH and P(0)CH 3 ; the 
' subscript n is an integer of from 1 to 98; W 1 and W 2 are each independently selected from H, 
a monophosphate, a diphosphate, a triphosphate and a minor groove binder-linking group 
moiety having an optionally attached reporter group or quencher; and each B is 
independently selected from the group consisting of adenine, guanine, cytosine, uridine and 

1 5 modified bases of the formula: 



wherein X" and X 12 are each independently selected from H, NH 2 and a protected amino 
group; each R n is independently selected from protected or unprotected forms of 3- 
hydroxypro P -l-ynyl, 3-aminoprop-l-ynyl, 3-methoxyprop-l-ynyl, 4-hydroxy-l-butynyl, 4- 
20 amino- 1-butynyl, and 3-(hydroxymethyl)-4-hydroxy-l-butynyl; and each R 12 is 

independently selected from the group consisting of protected or unprotected forms of 3- 
hydroxyprop-l-ynyl, 3-aminoprop-l-ynyl, 3-methoxyprop-l-ynyl, 4-hydroxy-l-butynyl, 4- 
amino-l-butynyl,3-(hydroxymethyl)-4-hydroxy-l-butynyl,heterocyclyl(C,-C 12 )alkyl, 
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heterocyclyl(C 2 -C 12 )alkenyl, heterocyclyl(C 2 -C 12 )alkynyl and heterocyclyl, with the provsio 
that R 12 is other than 2-pyridylethynyl; and 

with the further proviso that at least one of the Bs is selected from the 
modified bases, and optionally, one or more of the Bs has an attached minor groove binder- 
linking group moiety, reporter group or a combination thereof. 

Particularly preferred modified bases are those that have been described 

above. 

EXAMPLES 

In the hydrolyzable probe assay, a labeled probe is added to a PCR reaction. 
The probe is complementary to a region between the two PCR primers and is labeled with 
two fluorophores, one of which quenches the fluorescence of the other. The probe is 
designed to hybridize to its complementary target sequence on ope of the PCR product 
strands at or above the strand extension temperature typically used in PCR (55-75°C). The 
polymerizing enzymes normally used in PCR (Tag polymerase in particular) possess an 
intrinsic 5'-exonuclease activity. During synthesis of new strands in the extension stage of 
the PCR reaction, this 5 '-exonuclease activity will act on complementary strands bound to the 
template If a probe, labeled as described above, is bound to the template, the 5 '-exonuclease 
activity associated with the polymerizing enzyme will liberate the bound fluorophore. Once 
liberated, its fluorescence will no longer be quenched, and a fluorescent signal will be 
obtained. See, for example, U.S. Patent No. 5,210,015; Livak et al. (1995) PCR Meth. App. 
4-357-362- and Heid et al. (1996) Genome Res. 6:986-994. 

Thin-layer chromatography was run on silica gel 60 F-254 (EM Reagents) 
aluminum-backed plates. 'H NMR were obtained at 300 MHz on a Varian VXR-300 
spectrometer. Two-dimensional (Cosy) and NOE experiments assisted in the assignment of 
proton resonances. Elemental analyses were performed by Quantitative Technologies Inc. 
(Boundbrook, NJ). 

Example 1 

30 This example illustrates the synthesis of 5-(Prop-2-ynyl-4-methylbenzoate)-5'- 

0- ( 4,4'-dimethoxytriphenylmethyl)-2'-deoxyuridine3'-[(2-cyanoethyl)MiV- 

diisopropylphosphoramidite] ( 3). 
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5-(Prop-2-ynyl-4-methylbenzoate)-2 '-deoxyuridine (1). 

To a mixture of 5 -iodo-2' -deoxyuridine (5.0 g, 14.12 mmol), Cul (270 rag, 
1.42 mmol), Pd(PPh 3 ) 4 (0.82 g, 0.714 mmol), and triethylamine (2.4 mL) in 30 mL of 
anhydrous DMF was added prop-2-ynyl 4-methylbenzoate (6.10 g, 35.06 mmol). The 
mixture was stirred under argon for 4 h. and then evaporated to dryness. The residue was 
triturated in methanol and the excess prop-2-ynyl 4-methylbenzoate that precipitated was 
filtered away. The filtrate was evaporated and the residue was purified by silica gel 
chromatography eluting with 10% methanol in ethyl acetate. The pure product fractions were 
evaporated to dryness and the residue was precipitated from ethyl aceate- ether: 3.14 g (56%) 
yield; TLC (10% methanol in ethyl acetate), R f =0.50; 'H NMR (DMSO-d 6 ) 5 11.68 (1H, s, 
uracil N-H), 8.29 (1H, s, 6-H), 7.89 and 7.35 (4H, 2 x d, J = 8.2 Hz, toluoyl Hs), 6.10 (1H, t, J 
= 6.4 Hz, l'-H), 5.24 (1H, d, J = 4.4 Hz, 3'-OH), 5.15 (2H, s, alkynyl-CH 2 ), 5.12 (1H, t, J = 
5.2 Hz, 5'-OH), 4.24 (1H, m, 3'-H), 3.79 (1H, q, 4'-H), 3.59 (2H, m, 5'-Hs), 2.39 (3H, s, 
toluoyl-CH,), 2.13 (2H, m, 2'-H). 

5-(Prop-2-ynyl-4-methylbenzoate)-5 '-0-(4,4 •-dimethoxytriphenylmethyl)-2 '-deoxyuridine 
(2). 

To a solution of 1 (3.0 g, 7.50 mmol) in 45 mL of anhydrous pyridine was 
added dimethoxytrityl chloride (3.0 g). The resulting solution was stirred for 4 h. at room 
temperature and then poured into 400 mL of 5% sodium bicarbonate solution. The mixture 
was extracted with ethyl acetate (2 x 300 mL) and the extract was dried over sodium sulfate, 
filtered and evaporated. The residue was purified by silica gel chromatography eluting with 
5% methanol in the ethyl acetate. The pure product fractions were pooled and evaporated 
affording 2 as a foam: 4.16 g (79%) yield 

5-(Prop-2-ynyl-4-methylbenzoate)-5 '-0-(4,4 '-dimethoxytriphenylmethyl)-2 '-deoxyuridine 3 '- 
[(2-cyanoethyl) N,N-diisopropylphosphoramidite] (3). 

To a solution of 2 (4.0 g, 5.70 mmol) in 130 mL of anhydrous methylene 
chloride, containing 3.0 mL of W-diisopropylethylamine was added chloro(2- 
cyanoethoxy)(W-diisopropylamino)phosphine (2.22 mL) under argon. The solution was 
stirred for 30 min at room temperature and then treated with 3 .0 mL of methanol. The 
solution was diluted with 300 mL of ethyl acetate and washed with 300 mL of 5% sodium 
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bicarbonate solution. The aqueous phase was extracted with 300 mL of ethyl acetate and the 
combined organic phases were dried over sodium sulfate, filtered and evaporated. The 
residue was purified by silica gel chromatography eluting with a gradient of 30 to 0% hexane 
in ethyl acetate (2% triethylamine). The pure fractions were pooled and evaporated and the 
residue was precipitated from ethyl acetate - hexanes: 3.35 g (65%) yield; TLC (20% hexane 
in ethyl acetate), R f = 0.82 and 0.71 (diasteromers); 3, P NMR (DMSO-d 6 ) 5 147.82 and 
147.45. 

Example 2 

Phase 1 Preparation of (2R,5R)-5-(4-amino-3-iodopyrazolo[3,4-dJpyrimidinyl)-2- 
(hydroxymethyl)oxolan-3-ol (9). 

3-Iodo-l,5-dihydropyrazolo[3,4-d]pyrimidin-4-one (4). 

x 5 The synthesis of this compound was previously reported by Taylor et. al 

{Tetrahedron, 48(37):8089-8100 (1992)) using a N-iodosuccinimide. Our synthesis, 
employing iodine momochloride as the iodinating agent is described below. 

To a 1.0 L solution of 1.2 M sodium acetate was added 4- 
hydroxypyrazolo[3,4-d]pyrimidine (25 g, 184 ramol) followed by iodine monochloride (46 g, 

20 284 mmol). The mixture was stirred in a sealed container for 4 h. at 1 1 0 °C. The completed 
reaction was cooled to room temperature and treated with a solution of 30 g of sodium 
metabisulfite in 200 mL of water. The white precipitate that formed was filtered and rinsed 
with cold water. The solid was then dissolved in 200 mL of 3.2 M potassium hydroxide 
solution. Hydrazine monohydrate (5 mL) was added and the solution was stirred for 15 min. 

25 The solution was then acidified to ca. pH 7 by addition of 50 mL of concentrated HC1 

followed by a fine pH adjustment with acetic acid. The solid that formed was filtered, rinsed 
with cold water and dried: 55.1 g (1 14% yield). Note- it is difficult to remove residual water 
by drying the solid under vacuum. The residual water will be removed by a pyridine 
evaporation process in the next step. 
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3-Iodo-l,5-dihydropyrazolo[3,4-d]pyrimidin-4-thione (5). 

Compound 4 (21.5 g, 82.1 mmol) was suspended in 150 mL of anhydrous 
pyridine and the mixture was evaporated to dryness. The solid was re-suspended in 170 mL 
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of dry pyridine (under argon) and P 2 S 5 (26.8 g) was added. The mixture was stirred for 10 
min at 65 °C and then for 2-6 hours at 95 °C. The resulting solution was cooled to room 
temperature and degassed by bubbling a stream of argon into the solution (sulfide gas is 
passed into a trap containing sodium hydroxide solution). The reaction solution was then 
5 reduced in volume until a thick syrup formed. Excess P 2 S 5 was decomposed by addition of 1 
mL aliquots of water, until the vigorous reaction ceased, followed by addition of 500 mL of 
water and 10 mL of acetic acid. The mixture was warmed to 70 °C for 1 h to expedite 
expulsion of hydrogen sulfide gas and then diluted with 500 mL of water and cooled in an 
ice-bath. The solid was filtered, washed with water and dryed: 19.8 g (87%) yield. 

0 

4-Ethylthio-3-iodopyrazolo[3,4-d]pyrimidine (6). 

Compound 5 (43.5 g, 1 57 mmol) was stirred in potassium hydroxide solution 
(38.6 g KOH in 350 mL of water) for 30 min. The mixture was filtered and the filtrate was 
acidified to pH 10 by addition of acetic acid and then diluted with 350 mL of absolute 
5 ethanol. Iodoethane (10 mL) was added and the solution was stirred at room temperature. 
Additional 10 mL aliquots of iodoethane were added after 30 min. and 1.0 h. The reaction 
was complete after a total of 90 min. Other runs required further addition of iodoethane to 
complete the reaction. The reaction solution was diluted with 700 mL of water and 20 mL of 
acetic acid. The mixture was cooled in an ice-bath and the solid was filtered, rinsed with 
20 water and dried. This crude product was dissolved in 300 of DMF and stirred at 90 °C for 15 
min. Insoluble material was filtered away and the filtrate was diluted with 1 L of water and 
cooled in an ice-water bath. The solid was filtered, rinsed with water and dried: 19 g (40%) 
yield. 

25 [(2R,5-R)-5-(4-Ethylthio-3-iodopyrazolo[3.4-d]pyrimidinyl)-3-(4- 

methylphenylcarbonyloxy)oxolan-2-yl]methyl 4-methylbenzonate (7). 

Compound 6 was converted to the corresponding potassium salt by reaction 
with 1 molar equivalent of potassium hydroxide in water. The resulting solution was 
evaporated to dryness and the residue was evaporated from dry acetonitrile. The potassium 

30 salt of 6 (29.0 g, 94.73 mmol) was dissolved in 80 mL of anhydrous DMF and then diluted 
with 830 mL of anhydrous aceton.tnle. The chlorosugar derivative (48 g, 123 mmol) was 
added in one portion and the mixture was stirred for 3 h and then diluted with 1.5 L of 25% 
aqueous methanol. The mixture was allow to sit at 5 °C overnight. The crystals were 
filtered, rinsed with 25% aqueous methanol and dried: 32.2 g (52%) yield. 
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(2RM)-2-(hydroxymethyl)-5-(3-iodo-4-methylpyrazolo[3,4-d]pyrim 

To a suspension of 7 (10.97 g, 16.66 mmol) in 250 mL of methanol was added 
22 mL of IN sodium methoxide in methanol. The mixture was stirred at reflux and the 
reaction progress was monitored as soon as the mixture turned to a clear solution Note- 
formation of side products are observed if refluxing is continued after complete conversion to 
5. The reaction was quenched by addition of 1.34 mL of acetic acid as soon as one 
component, corresponding to the desired product, is observed by TLC. The solution was 
evaporated and the residue was precipitated from methanol-ether-hexane. The solid was 
filtered and dried: 6.8 g (104%) yield crude product (contains sodium acetate). 

(2RM)-5-(4-amino-3-iodopyrazolo[3,4-d]pyrimidiny^ 

Compound 8 (6.8 g, 17.35 mmol) was stirred in 200 mL of concentrated 
ammonium hydroxide (sealed container) at room temperature for 36 h. The mixture was 
evaporated and the residue was precipitated from acetonitrile-ether. The solid was filtered 
and dried: 5.36 g (82%) yield. NMR and spectral data on this compound is identical to 

Phase 2 Preparation of 3-{ 4-(( lE)-l-aza-2-methylprop-l -enyl)-l-((2R, 5R)-5-{[Bis(4- 
methoxyphenyl)phenylmethoxy]methyl}-4-{[bis(methylethyl)amino](2- 
cyanoethoxy)phosphinooxy}oxolan-2-yl)pyrazolo[3A-d]pyr^ 4- 
methoxybenzoate (13; Rl =-OCOPhCH 3 ) 

3-{l-{(2RM)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-4-aminopy 

3-yl}prop-2-ynyl} 4-methylbenzoate C10; R 1 = -OCOPhCH 3 ) 

To a mixture of 9 (2.40 g, 6.37 mmol), Cul (124 mg, 0.648 mmol), Pd(PPh 3 ) 4 
(380 mg, 0.331 mmol), and triethylamine (1.32 mL) in 12 mL of anhydrous DMF was added 
prop-2-ynyl 4-methylbenzoate (1 .87 g, 1 1 .85 mmol). The mixture was stirred under argon 
for 12 h. and then evaporated to dryness. The residue was purified by silica gel 
chromatography eluting with 5% methanol in ethyl acetate. The pure product fractions were 
evaporated affording a foam: 2.29 g (85%) yield; TLC (10% methanol in ethyl acetate), R f = 
0.43; 'H NMR (DMSO-d 6 ) 5 8.26 (1H, s, 6-H), 7.92 and 7.37 (4H, 2 x d, J = 8.5 Hz, 
toluoyl-Hs), 6.55 (1H, t, J = 6.3 Hz, l'-H), 5.29 (2H, s, alkynl-CH 2 ), 5.28 (1H, d, J = 4.7 Hz, 
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3'-OH), 4.76 (1H, t, J = 5.7 Hz, 5'-OH), 4.41 (1H, m, 3'-H), 3.81 (1H, m, 4'-H), 3.49 and 
3.56 (2H, 2 x m, 5'-Hs), 2.76 and 2.24 (2H, 2xm, 2'-Hs), 2.39 (3H, s, toluoyl- CH 3 ). 



3-{4-((lE)-l-aza-2-methylprop-l-enyl)-l-((2R,5R)~4-hydroxy-^ 
5 yl]pyrazolo[3,4-d]pyrimidin-3-yl}prop-2-ynyl 4-methylbenzoate (11; R 1 = -OCOPhCH 3 ). 

Compound 10 (1.76 g, 4.16 mmol) was stirred in a solution of 5.0 mL of N,N- 
dimethylacetamide, 1.9 mL of N-dimethylacetamide dimethyl acetal and 2.0 mL of 
anhydrous triethylamine for 15 h. at room temperature. The solvents were evaporated and the 
residue was evaporated twice from xylenes affording 11 as a foam. TLC (10% methanol in 

10 ethyl acetate), R f = 0.29; l H NMR (DMSO-d 6 ) 5 8.51 (1H, s, 6-H), 7.90 and 7.37 (4H, 2 x 
d, J = 8.3 Hz, toluoyl protons), 7.28, 7.14 and 6.74 (13H, d and 2 m, J = 7.1 Hz for the 
doublet), trityl protons), 6.63 (1H, m, l'-H), 5.32 (1H, d, J = 5.0 Hz, 3'-OH), 5.24 (2H, s, 
alkynl-CH 2 ), 4.56 (1H, m, 3'-H), 3.93 (1H, m, 4'-H), 3.68 (6H, s, methoxy Hs), 3.32 and 3.1 1 
(6H, 2 x s, N-methyls), 3.09-2.95 (2H, m, 5'-Hs), 2.81 and 2.32 (2H, 2 x m, 2'-Hs), 2.40 (3H, 

15 s, = - Me), 2.19 (3H, s, toluoyl-Me). 

3-{4-((lE)-l-aza-2-methylprop-l-enyl)-l-((2R,5R)-5-{[Bis(4^ 
methoxyphenyl)phenylmethoxy] methyl} -4-hydroxyo 
ylJprop-2-ynyl 4-methylbenzoate (12; R 1 =-OCOPhCH 3 ) 

20 To a solution of 1 1 in 25 mL of anhydrous pyridine was added dimethoxytrityl 

chloride (1 .67 g). The resulting solution was stirred for 4 h. at room temperature and then 
poured into 250 mL of 5% sodium bicarbonate solution. The mixture was extracted with 
ethyl acetate and the extract was dried over sodium sulfate, filtered and evaporated. The 
residue was purified by silica gel chromatography eluting with 5% methanol in the ethyl 

25 acetate. The pure product fractions were pooled and evaporated affording 12 as a foam: 2.06 
g (62%) yield for the two-step process. 

3-{4-((lE)'l-aza-2-methylprop'l'enyl)-l-((2R t 5R)-5-{[Bis(4^ 
methoxyphenyl)phenylmethoxy]methyl}-4-{[bis(meth^^ 
30 cyanoethoxy)phosphinooxy}oxolan-2-yl)pyrazolo[3A-d]pyrim^ 4- 
methoxybenzoate (13; R l = -OCOPhCH s ) 
or, 
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l-[2-Deoxy-5-0-(4.4'-dimethoxytriphenylmethyl)-J^D-erythro-pentofuranosyl]-4- 
[(dimethylamino)prop-l-enyl]-3-(prop-2-ynyl-4-methylbenzoate) lH-pyrazolo[3,4- 
djpyrimidine 3 '-[(2-cyanoethyl) N,N-diisopropylphosphoramidite] (13). 

To a solution of 12(2.03 g, 2.56 mmol) in 60 mL of anhydrous methylene 
chloride, containing 1.33 mL of M^-diisopropylethylamine was added chloro(2- 
cyanoethoxy)(N,W-diisopropylamino)phosphine (1.0 mL) under argon. The solution was 
stirred for 1 .0 h at room temperature and then treated with 2.0 mL of methanol. The solution 
was diluted with 250 mL of ethyl acetate and washed with 200 mL of 5% sodium 
bicarbonate solution. The organic phase was dried over sodium sulfate, filtered and 
evaporated. The residue was purified by silica gel chromatography eluting with 2% 
triethylamine in ethyl acetate. The pure fractions were pooled and evaporated. The 
phosphoramidite (13) was precipitated from ether-hexanes: 1 .82 g (71%) yield; TLC (5 % 
methanol in ethyl acetate), R f = 0.32; 3, P NMR (DMSO-d 6 ) 6 147.90 and 147.22. 



Example 3 

This example illustrates the preparation of N-{3-[l-((2R,5R)-5 {[bis(4- 
methoxyphenyl)phenylmethoxy]methyl}-4-{[bis(methylethyl)amino](2- 
cyanoethoxy)phosphinooxy}oxolan-2-yl)-6-amino-4-oxo(5-hydropyrazolo[3,4-d]pyrimidin- 

3-yl)]propyl}-2,2,2-trifluoroacetamide (22) 

3-Iodo-4-methoxypyrazolo[3,4-d]pyrimidine-6-ylamine(14). 

4-Methoxy-pyrazolo[3,4-cTJpyrimidin-6-ylamine (6.75 g, 40.87 mmol) was 
suspended in an aqueous solution containing sodium acetate (6.0 g, 44.09 mmol) and iodine 
monochloride (9.12 g, 56.17 mmol) and stirred at 100 °C in a sealed reaction vessel for 24 h. 
The resulting mixture was cooled to room temperature and treated with a solution of sodium 
metabisulfite (3.6 g, 18.94 mmol) in 24 mL of water. The solid that formed was filtered, 
rinsed with water and dried: 6.93 g (58%) yield; TLC (10% methanol in ethyl acetate), R f = 
0.57; 'H NMR (DMSO-d 6 ) 6 13.08 (lH.br s, N-l proton), 10.58 (1H, s, N-5 proton), 6.60 
(2H, br s, 6-amino). 

[(2R.5R)-5-(6-Amino-3-Iodo-4-methoxypyrazolo[3,4-d]pyrimidinyl)-3-(4- 
methylphenylcarbonyloxy)oxolan-2-yl]methyl 4-methylbenzoate (1 5). 
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To a suspension of (14) (6.68 g, 22.95 mmol) in 150 mL of methanol was added 8.05 
mL of a 2.85 M methanolic potassium hydroxide. The mixture was stirred for one minute 
and then diluted with 100 mL of toluene and evaporated. The solid potassium salt of 1 was 
dried under vacuum. The potassium salt was suspended in 75 mL of anhydrous DMF and 

5 then diluted with 420 mL of anhydrous acetonitnle. l-Chloro-l,2-dideoxy-3,5-di-0- 

toluoylribofuranose (8.95 g, 22.95 mmol) was immediately added and the reaction mixture 
was stirred at room temperature under argon for 1 h. and then filtered. The filtrate was 
evaporated and the residual foam was recrystallized twice from methanol: 6.59 g (45%) 
yield; TLC (1:1. ethyl acetate/hexane), R f = 0.52; l H NMR (DMSO-d 6 ) 5 7.93 and 7.37 (8H, 

10 m and d J = 7.9 Hz for the d, aromatic), 7.10 (2H, br s, 6-amino), 6.51 (1H, t, J = 6.6 Hz, 1>- 
H), 5.69 (1H, m. 3>-H), 4.44 (3H, m, 4'- and 5'-Hs), 3.99 (3H, s, 4-methoxy), 3.19 and 2.63 
(2H, 2 x m, 2*-Hs), 2.40 and 2.38 (6H, 2 x s, toluoyl methyl protons). 

[(2R,5R)-5-(6-Amino-3-Iodo-4-methoxypyrazolo[3A-d]pyrimid^ 

1 5 (hydroxymethyl)oxolanS-ol (1 6). 

To a suspension of 15 (32.4 g, 50.40 mmol) in 600 mL of methanol was added f2.5 
m L of 1M sodium methoxide in methanol. The reaetion mixture was stirred at reflux for 1 8 
h and then cooled in a freezer (-10 °C) . The crystals of 3 that formed were filtered and 
rinsed with ice-co.d methanol: 10.95 g yield. Acetic acid (12.5 mL) was added to the fihra.e 
and the volume was reduced to about 300 mL. The solution was allowed to si. m the freezer 
overnight and another crop of crystals (3.95 g) was collected. The filtrate was evaporated to 
dryness and .he residual oil was tritura.ed in e.her. The solid .ha. formed was filtered, dned 
and recrys.a,lized from boding water: 2.03 g. To.a, yie,d - 16.93 g (83%); TLC (10% 
methanol in ethyl ace.a.e), R r - 0.23: 'H NMR (DMSO-d 6 ) 8 7.02 (2H, br s. 6-amino), 6.33 
(1H t, J =6.1 Hz), 5.24 (,H, d, J - 4.4 Hz, 3'-OH), 4.73 (1H, ,, J - 5.6 Hz, 5--OH), 4.36 (1H, 
" m 3'-H), 3.99 (3H, s, 4-me.hoxy), 3.75 (1H, m, 4'-H), 3.51-3.32 (2H, m, 5'-Hs), 2.79 and 
2.17 (2H, 2 X m, 2'-Hs). Anal. Calcd. For C„H, 4 fN s 0 4 ■ 0.3 H 2 0; C, 32.02; H, 3.57; N, 
16.98. Found C, 32.13; H, 3.35; N, 16.77. 

30 /-ff2*.J/!Ha-/.y*^^ 
d]pyrimidin-4-one (17). 

A suspension of (16) (16.84 g, 41.36 mmol) in 1.5 L of 1 N sodium hydroxide 
solution was heated to reflux. The mixture turned into a homogenous solution and was 
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cooled to 25 °C using an ice-water bath. Acetic acid (90 mL) was added and the solution was 
stored at 5 °C overnight. The crystals that formed were filtered, rinsed with ice-cold water 
and dried: 7.47 g yield of 4. The filtrate was evaporated to a volume of about 500 mL and 
• stored at 5 °C overnight. A second crop of crystals was collected (3.75 g). The filtrate was 
5 evaporated to about 200 mL. The sodium acetate that formed was filtered off and the filtrate 
was stored at 5 °C overnight. Another crop of crystal was collected (2.25 g). Total yield: 
13.47 g (83 %); TLC (5:3:2, n-butanol/water/acetic acid), R f = 0.74; >H NMR (DMSO-d 6 ) 5 
6.22 (1H, t, J = 6.3 Hz, 1 '-H), 4.33 (1H, m, 3'-H), 3.73 (1H, m, 4>-H), 3.47 and 3.36 (2H, 2 x 
m, 5'-Hs), 2.63 and 2.12 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C 10 H l2 IN 5 O 4 ■ 0.7 H 2 0; C, 
10 29.60; H, 3.33; N, 17.26. Found C, 29.81; H, 3.02; N, 17.00. 

N-(3.{l-[(2R,5R)-4-(hydroxymethyl)oxalan-2-yl]-6^mm 
d]pyrimidin-3-yl)}prop-2-ynyl)-2,2Mrifluoroacetamide(18). 

To a mixture of 17 (6.00 g, 15.26 mmol), Cul (297 mg, 1.56 mmol) and 
tetrakis[tri P hen y lphos P hine] P alladium[0] in 30 mL of anhydrous DMF was added anhydrous 
triethylamine (3.14 mL) followed by propargyl trifluoroacetimide (4.29 g, 28.41 mmol). The 
reaction solution was stirred under argon for 40 h. The DMF was evaporated off and the 
residual oil was triturated in chloroform. The solid of crude 5 that formed was filtered, rinsed 
with chloroform and dned. The solid was dissolved in a minimum volume of DMF, absorbed 
onto silica gel and evaporated. The dry mixture was loaded onto a silica gel column and 
eluted with 10% methanol in ethyl acetate. The product fractions were pooled and 
evaporated. The residue was precipitated from ethyl acetate/ether: 4.0 g (63 %) yield; TLC 
(200/c methanol in ethyl acetate), R f = 0.59; 'H NMR (DMSO-d 6 ) 5 10.79 (1H, s, N 5 -H), 
10 16 (1H t, J = 5.2 Hz, trifluoroacetimido N-H), 6.77 (2H, br s, 6-amino), 6.28 (1H, t, J - 
25 6 3 Hz, l'-H), 5.23 (1H, d, J - 4.1 Hz, 3'-OH), 4.72 (1H, t, J = 5.1 Hz, S'-OH), 4.32 (3H, m, 
-CONH-Ofe.- and 3'-H), 3.75 (1H, m, 4*-H), 3.50-3.29 (2H, 2 x m, S'-Hs). 2.65 and 2.15 
(2H, 2 x m, 2'-Hs). Anal. Calcd. For C 15 H 15 F3N 6 0 5 ■ 0.74 H z O; C, 41.93; H, 3.87; N, 19.56. 
Found C, 42.33; H, 3.64; N, 19.13. 

30 N .( 3 .{l-[(2R.5R)-4-(hydroxymethyl)oxalan-2-yl]-6-amino-4^ 
d]pyriminin-3-yl)}propyl)-2.2.2-trifluoroacetamide(19). 

To a solution of (18) (1.0 g, 2.40 mmol) in 100 mL of methanol, containing 0.12 g of 
20% palladium hydroxide (preactivated with formic acid), was added 2.0 mL of 4 M 
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triethylammonium formate buffer (pH 6.5). The mixture was shaken under 35 psi of 
hydrogen gas for 18 h (during some runs it was necessary to add additional catalyst for 
complete reduction). The mixture was filtered through Celite and the filtrate was evaporated. 
The residual oil was crystallized from water: 0.79 g (78%) yield; TLC (20% methanol in 
ethyl acetate), R, = 0.52; 'H NMR (DMSO-d 6 ) 5 10.59 (1H, s, N 5 -H), 9.47 (1H, br t, 
trifluoroacetimido N-H), 6.64 (2H, br s, 6-amino), 6.27 (1H, t, J = 6.3 Hz, l'-H), 5.18 (1H, d, 
j = 4.4 Hz, 3'-OH), 4.75 (1H, t, J = 5.9 Hz, 5'-OH), 4.36 (1H, m, 3'-H), 3.75 (1H, m, 4'-H), 
3.48 and 3.61 (2H, 2 x m, 5'-Hs), 3.22, 2.68 and 1.87 (6H, 3 x m, propyl methylene protons), 
2.68 and 2.12 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C5H19F3N6O5 ' 0.90 H 2 0; C, 41.27; H, 
4.80; N, 19.25. Found C, 41 .57; H, 4.50; N, 19.11. 

H-(3-[(lE)-l-aza-2-(dimethylamino)prop-l-enyl]-[^ 

amino-4-oxo(5-hydropyrazolo[5-4-d]pyriminin-3-yl)}propyl^ 

To a solution of (19) (0.80 g, 1.98 mmol) in 5.0 mL of anhydrous DMF was added 

N,N-dimethylformamide dimethylacetal (3.1 mL). The solution was stirred for 2.0 h. under 
argon and then evaporated. The residue was evaporated twice from xylenes and then placed 
under vacuum. The amorphous solid (20) that formed was triturated with ether and collected: 
773-mg (82%) yield; TLC (20% methanol in ethyl acetate), R f = 0.47; 'H NMR (DMSO-d 6 ) 
8 1 1.22 (1H, br s, N 5 -H), 9.47 (1H, t, J = 5.5 Hz, trifluoroacetimido N-H), 8.67 (1H, s, 
N=CH-N), 6.42 (1H, t, J = 6.5 Hz, l'-H), 5.22 (1H, d, J = 4.3 Hz, 3'-OH), 4.75 (1H, t, J = 6.1 
Hz, S'-OH), 4.40 (1H, m, 3'-H), 3.77 (1H, m, 4'-H), 3.50 and 3.38 (2H, 2 x s, 5'-Hs), 3.18 
and 3.05 (6H, 2 x s, N,N-dimethyl Hs), 3.22, 2.72 and 1.89 (6H, 3 x m, propyl methylene 
Hs), 2.72 and 2.15 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C I8 H 2 4F 3 N 7 0 5 ' 0.40 H 2 0; C, 44.80; 
H, 5.18; N, 20.32. Found C, 45.02; H, 4.96; N, 19.94. 

N .{3.[l.((2R.SR)-5-{[bis(4-methoxyphenyl)phenylmeth^ 
amino-4-oxo(5-hydropyrazolo[3A-d]pyrin^^ 

To a solution of (20) (723-mg, 1 .52 mmol) in 9.0 mL of anhydrous pyridine was 
added 4,4'-dimethoxytntyl chloride (0.61 g, 1-80 mmol). The reaction solution was stirred 
for 3.0 h. under argon and then poured into 100 mL of 5% sodium bicarbonate solution. The 
aqueous solution was extracted with ethyl acetate (2 x 200 mL) and the dned extracts 
(sodium sulfate) were evaporated. The crude product was purified by silica gel 
chromatography eluting with a solvent gradient of 0-5% methanol in ethyl acetate (2% 
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triethylamine). The product fractions were evaporated affording 21 as an amorphous solid: 
724-mg (61%) yield; TLC (5% methanol in ethyl acetate), R f = 0.39; l H NMR (DMSO-de) 6 
1 1.27 (1H, s, N 5 -H), 9.43 (1H, t, J = 5.3 Hz, trifluoroacetimido N-H), 8.71 (1H, s, N=CH-N), 
7.32, 7.17 and 6.76 (13H, 3 x m, aromatic), 6.45 (1H, t, J = 6.3 Hz, l'-H), 5.26 (1H, d, J = 5.3 
Hz, 3'-OH), 4.45 (1H, m, 3'-H), 3.90 (1H, m, 4»-H), 3.70 (6H, s, OMe Hs), 3.18 and 3.05 
(10H, 2 x s, N,N-dimethyl, 5'-Hs and CONH-Ctf 2 ), 2.62 and 1.65 (4H, 2 x m, methylene Hs), 
2.62 and 2.20 (2H, 2 x m, 2'-Hs). Anal. Calcd. For C39H42F3N7O7 ' 0.30 H 2 0; C, 59.81; H, 
5.48; N, 12.52. Found C, 59.80; H, 5.39; N, 12.63. 

N-{3-[l-((2R,5R)-5{[bis(4-methoxyphenyl)phenylmethoxy]methyl}-4- 

{[bis(methylethyl)amino](2-cyanoethoxy)phosphinooxy}oxo^ 
hydropyrazolo[3A-d]pyrimidin-3-yl)]propyl}-2X2-triJluoroacetamid^ 

To a solution of (21) (700-mg, 0.900 mmol) in 22 mL of anhydrous methylene 
chloride, containing 0.47 mL of diisopropylethylamine, was added 2-cyanoethyl N,N- 
diisopropylchlorophosphoramidite (0.34 mL, 1.52 mmol). After stirring for 30 minutes under 
argon at 25 °C the solution was treated with 3.0 mL of methanol and diluted with 200 mL of 
ethyl acetate. The solution was washed with 100 mL of 5% sodium bicarbonate solution and 
dried over sodium sulfate and evaporated. The crude product was purified by silica gel 
chromatography eluting with 2% triethylamine in ethyl acetate. The product fractions were 
evaporated and the residue was precipitated from ether - hexane: 583-mg (66%) yield; TLC 
(ethyl acetate), R r = 0.38; 31 P NMR (DMSO-d 6 , reference to 85% phosphoric acid) 5 145.50 
and 144.72. 

Example 4 

This example illustrates the preparation of 3-{[5-(4,6-bis{(lE)-l-aza-2-[bis(2- 
methylpropyl)amino] vinyl} -3-prop-l -ynylpyrazolo[3,4-d]pyrimidinyl)-2-{[bis(4- 
methoxyphenyl)phenylmethoxy]-methyl}oxolan-3-yloxy][ethyl(methylethyl)amino}- 

phosphinojpropanenitrile (26) 

5.(4 t 6-bi S {(lE)-l-aza-2-[bis(2-methylpropyl)amino]vinyl}-3-prop-l^ 

d]pyrimidinyl)-2 (hydroxymethyl)oxolan-3-ol (24) 

Compound (23) (1 mmol) can be stirred for 5 hr at room temperature with 
(dimethoxymethyl)bis(2-methylpropyl)amine (0.5 mL, 3.37 mmol). The reaction mixture can 



be reduced in vacuo, and the protected compound can be purified on silica gel to yield (24) 
(Vincent et al). 

5.(4,6-bis{(lE)-l-aza-2-[bis(2-methylpropyl)aminoJvinyl}-3-prop^ 
d]pyrimidinyl)-2-{[bis(4-methoxyphenyl)phenylmethoxy]^ (25) 

To a solution of (24) (1.50 mmol) in 9.0 mL of anhydrous pyridine 4,4'- 
dimethoxytrityl chloride (0.61 g, 1 .80 mmol) can be added. The reaction solution can be 
stirred for 3.0 h. under argon and then can be poured into 100 mL of 5% sodium bicarbonate 
solution. The aqueous solution can be extracted with ethyl acetate (2 x 200 mL) and the 
extracts can be dried(sodium sulfate) and evaporated. The crude product can be purified by 
silica gel chromatography with an appropriate gradient to yield (25). 

3 . { [ 5 .(4,6-bis{(lE)-l-aza-2-[bis(2-methylpropyl)amino]v^ 
d]pyrimidinyl)-2-{[bis(4-methoxyphenyl)phenylmethoxy]methy^ 
yloxy][ethyl(methylethyl)amino}phosphino}propanenitrile (26) 

To a solution of (25) ( 0.900 mmol) in 22 mL of anhydrous methylene 
chloride, containing 0.47 mL of diisopropylethylamine, 2-cyanoethyl N,N- 
diisopropylchlorophosphoramidite (0.34 mL, 1 .52 mmol) can be added. The solution can be 
stirred for 30 minutes under argon at 25 °C and the can be treated with 3.0 mL of methanol 
and diluted with 200 mL of ethyl acetate. The solution can then be washed with 100 mL of 
5% sodium bicarbonate solution and dried over sodium sulfate and evaporated. The crude 
product can be purified by silica gel chromatography with an appropriate gradient to yield 
(26). 

Example 5 

This example illustrates the synthesis of 5-[4,6-diamino-3-(2- 
methoxyethynyl)pyrazolo[3,4-d]pyrimidinyl]-2-(hydroxymethyl)oxolan-3-ol (27). 

5-[4.6-dia m ino-3-(2-methoxyethynyl)py^ 
3-ol (27) 

To a mixture of 4,6-diamino-l-(2-deoxy-P-D- e r ) ;// I ro-pentofuranosyl)-3-iodo- 
l//-pyrazolo[3,4-d]pyrimid,ne (4.2 g, 10.71 mmol), Cul (21 1 mg, 1.10 mmol), Pd(PPh 3 ) 4 
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(635 mg, 0.553 mmol), and triethylamine (2.2 mL) in 20 mL of anhydrous DMF was added 
methyl propargyl ether (1.82 mL). The mixture was stirred under argon for 16 h. and then 
evaporated to dryness. The residue was crystallized from methanol: 3.20 g (89%) yield; 
TLC (30% methanol in ethyl acetate), R r = 0.63; *H NMR (DMSO-d 6 ) 5 7.44 (2H, br d, 
amino), 6.32 (1H, t, J = 6.6 Hz, 1 '-H), 6.33 (2H, br s, amino), 5.20 (1H, d, J = 4.1 Hz, 3*- 
OH), 4.75 (1H, br t, 5'-OH), 4.40 (2H, s, methylene), 4.36 (1H, m, 3'-H), 3.76 (1H, m, 4'-H), 
3.47 & 3.32 (2H, 2 x m, 5*-Hs), 3.32 (3H, s, methoxy), 2.68 & 2.14 (2H, 2 x m, 2'-Hs). 

Synthesis of5-{6-[(l E) - 1 -aza-2-(dimethylamino) vinyl] -4- [ (1Z)-1 -aza-2- 
(dimethylamino)vinyl]pyrazolo[3A-d]pyrimidinyl}-2-(hydroxyme 1- 

methoxyprop-l-yne (28). 

Compound 27 (3.1 g, 9.28 mmol) was stirred in a solution of 30 mL of N,N- 
dimethylformamide and 15 mL of A^-dimethylformamide dimethyl acetal for 14 h at 45 °C. 
The solvents were evaporated and the residue was evaporated twice from xylenes affording 
28 as a foam which was precipitated from ethyl acetate - ether: 2.8 g (68%) yield; TLC (50% 
methanol in ethyl acetate), R f = 0.36. 

Synthesis of 3-[(5-{6-[(lE)-l-aza-2-(dimethylamino)vinyl]-4-[(lZ)-l-aza-2- 

(dimethylamino)vinyl]pyrazolo[3J-d]pyrimidinyl}-2-{[bis(4-methoxyph^ 

methoxy]methyl}oxolan-3-yloxy)[bis(methylethyl)amino]pho 

To a solution of 28 (2.7 g, 6.08 mmol) in 45 mL of anhydrous pyridine was 
added dimethoxytrityl chloride (2.4 g). The resulting solution was stirred for 3 h. at room 
temperature and then poured into 200 mL of 5% sodium bicarbonate solution. The mixture 
was extracted with ethyl acetate and the extract was dried over sodium sulfate, filtered and 
evaporated. The residue was purified by silica gel chromatography eluting with a gradient of 
0-40% methanol in ethyl acetate. The pure 5'-C-DMT derivative fractions were pooled and 
evaporated affording a foam: 1.0 g (22%) yield. 

To a solution of DMT derivative (0.98 g, 1.31 mmol) in 16 mL of anhydrous 
methylene chloride, containing 0.70 mL of N.W-diisopropylethylamine was added 2- 
cyanoethyl diisopropylchlorophosphoramidite (0.50 mL) under argon. The solution was 
stirred for 30 min at room temperature and then treated with 1.0 mL of methanol. The 
solution was loaded directly on to a silica gel column and eluted with a gradient of 0-20% 
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methanol in ethyl acetate (2% triethylamine). The pure fractions were pooled and evaporated 
affording a foam: 0.25 g (20%) yield. 



Example 6 



This example illustrates the preparation of 3-[(5-{4-[(lZ)-l-aza-2- 
(methylamino)vinyl]-3-(2-furyl)pyrazolo[3,4-d]pyrimidinyl}-2-{[bis(4- 
methoxyphenyl)phenylmethoxy]methyl}oxolan-3-yloxy)[bis(methylethyl)amino]- 

phosphinooxy]propanenitrile (35) (See Reaction Scheme 8) 

(2-Furylmethoxymethylene)methane-l,l-dicarbonitrile (30) 

To an ice-cold solution containing anhydrous methylene chloride (500 mL), 
triethylamine (100 mL), and malonitrile (30g, 454 mmol) was added 2-furanyl chloride (50 g, 
383 mmol) by dropwise addition over a 20 min period. Stirring was then continued for an 
additional hour at ambient temperature. The reaction solution was washed with 1.5 L of ice- 
cold 2N HC1 solution and then with 1.5 L of water. The organic phase was evaporated 
affording an oil. 

A portion of the product oil (4.6 g, 28.75 mmol) was dissolved in a solution 
consisting of 40 mL of dioxane and 4.0 mL of water. Dimethyl sulfate (15 mL) and sodium 
bicarbonate (15g) were added and the reaction solution was stirred for 2.5 h at 80 °C. The 
mixture was then dissolved in 100 mL of water and the product was extracted with 200 mL of 
ethyl acetate. The organic solution was washed with 100 mL of 5% sodium bicarbonate 
solution followed by 100 mL of water. The solution was dried over sodium sulfate, filtered 
and evaporated affording an oil which solidified under vacuum: 2.72 g yield; TLC (1:1, 
ethyl acetate/hexane), R r = 0.42; *H NMR ((DMSO-ek) 5 8.25 (1H, m, aromatic), 7.63 (1H, 
m, aromatic), 6.92 (1H, m, aromatic), 4.23 (3H, s, methoxy). 

2-Amino-4-(2-furyl)furan-3-carbonithle (31) . 

Hydrazine monohydrate (3.4 mL, 80 mmol) was added dropwise to an ice- 
cold solution of 30 (2.72 g, 15.63 mmol) in 75 mL of methanol over a penod of 15 min. The 
solution was then evaporated to dryness affording an oil which solidified under vacuum. The 
solid was triturated in ether, filtered and dried: 2.2 g (81%) yield; TLC (ethyl actate), R f = 
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0.81; 'H NMR (DMSO-rf d ) 5 7.77 (1H, s, furanyl), 6.80 (1H, m, furanyl), 6.61 (1H, m, 
furanyl), 6.41 (2H, br s, amine). 

3-(2-furyl)pyrazolo[3,4-d]pyrimidine-4-ylamine (32). 

Compound 31 (25.7g, 148 mmol) was stirred in 250 mL of formamide at 190 
°C for 4 h. The solution was then cooled to room temperature, diluted with 1 .2 L of water 
and chilled in an ice-bath. The solid that formed was filtered and dried: 22 g (74%) yield; 
TLC (5% methanol in ethyl actate), R t - = 0.25. 

[5-(4-Amino-3-(2-furyl)pyrazolo(3.4-d]pyrimidinylW 
oxoethyl]oxolan-2-yl] methyl 4-methylbenzoate (33). 

Compound 32 (10 g, 49.7 mmol) was stirred in 200 mL of 0.29 M methanolic 
KOH solution for 5 min. The mixture was evaporated to dryness and the residue was then 
dissolved in 40 mL of hot anhydrous DMF. The solution was cooled to room temperature 
and then diluted with 230 mL of anhydrous acetonitrile. The chlorosugar derivative (23 g, 
59.14 mmol) was added immediately and the mixture was stirred for 45 min and then 
evaporated to dryness. The residue was dissolved in 800 mL of ethyl acetate and washed 
with water (2 x 800 mL). The organic solution was dried over sodium sulfate, filtered and 
evaporated. The crude product was purified by silica gel chromatography eluting with a 
gradient of 30-0% hexane in ethyl acetate. The pure product fractions were evaporated and 
the residue was crystallized from hot methanol: 3.4 g (12%) yield; TLC (ethyl actate), R f = 
0.47; 'H NMR ((DMSO-4,) 5 8.27 (1H, s, 6-H), 7.96 (1H, m, furanyl), 7.95, 7.84, 7.36 & 
7.23 (8H, 4 x d, toluoyl aromatic), 6.83 (1H, m, furanyl), 6.80 (1H, t, J=6.3 Hz, l'-H), 6.70 
(1H, m, furanyl), 5.86 (1H, m, 3'-H), 4.64-4.42 (3H, m, 4'-H and 5'-Hs), 3.28 & 2.76 (2H, 2 
x m, 2'-Hs), 2.39 & 2.36 (6H, 2 x s, toluoyl methyls). 

5 .{4.l(lZ)-l-aza-2-(dimethylamino)vinyl]-3-(2-furyl)pym 

(hydroxymethyl)oxolan-3-ol (34). 

A suspension of 33 (3.36 g, 6.08 mmol) in 17 mL of 0.12 M methanolic 
sodium methoxide solution was refluxed for 30 min. The resulting solution was cooled to 
room temperature and neutralized by addition of 0.12 mL of acetic acid. The solution was 
evaporated to dryness and the product was precipitated from methanol-ether and then 
recrystallized from boiling water: 1 .63 g (85%) yield of the deprotected nucleoside. 
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All of the nucleoside product from above was stirred in a solution consisting 
of 30 mL of anhydrous DMF and 15 mL of N,N-dimethylformamide dimethylacetal for 5 h. 
The solution was evaporated to dryness and the residue was then evaporated twice from 
xylenes affording a foam: 1 .89 g (99%) yield; TLC (20% methanol in ethyl actate), R f = 
0.45; 'H NMR ((DMSO-rf<s) 6 8.98 & 8.46 (2H, 2 x s, formamidine C-H and 6-H), 7.97 
(1H, m, furanyl), 7.81 (1H, m, furanyl), 6.65 (2H, m, F-H and furanyl), 5.30 (1H, d, J = 4.5 
Hz, 3'-OH), 4.79 (1H, t, J = 5.5 Hz, 5'-OH), 4.51 (1H, m, 3'-H), 3.85 (1H, m, 4'-H), 3.57 & 
3.40 (2H, 2 x m, 5'-Hs), 3.26 & 3.21 (6H, 2 x s, N-Me protons), 2.88 & 2.30 (2H, 2 x m, 2'- 
Hs). 



3-[(5-{4-[(lZ)-l-aza-2-(methylamino)vinyl]-3-(2-furyl)pyrazolo[3,4-d] 

{[bb(4-methoxyphenyl)phenylmethoxy]methyl}oxolan-3-yl 

phosph inooxyjpropanenitrile (35). 

Dimethoxytrityl chloride (2.01 g) was added to a solution of 34 (1.84 g, 4.95 

15 mmol) dissolved in 30 mL of dry pyridine. The reaction solution was stirred for 3 h at room 
temperature and then poured into 200 mL of 5% sodium bicarbonate solution. The product 
was extracted with 300 mL of ethyl acetate and the organic phase was dried over sodium 
sulfate and evaporated. The residue was purified using silica gel chromatography eluting 
with a gradient of 0-5% methanol in ethyl acetate. The pure product fractions were 

20 evaporated affording a foam: 2.26 g (68%) yield of the 5'-0-DMT derivative. 

To a solution of the DMT derivative (2.18 g, 3.23 mmol) dissolved in 40 mL 
of dry methylene chloride, containing 1.75 mL of N,N-diisopropylethylamine, was added 
1.25 mL of 2-cyanoethyl diisopropylchlorophosphoramidite. The solution was stirred under 
argon for 15 min at room temperature and then treated with 5 mL of methanol. The resulting 

25 solution was diluted with 500 mL of ethyl acetate and washed with 400 mL of 5% sodium 
bicarbonate solution. The organic solution was dried over sodium sulfate, filtered and 
evaporated. The residue was purified by silica gel chromatography eluting with a gradient of 
5-10 % methanol in ethyl acetate (2% triethylamine). The pure product fractions were 
evaporated affording a foam: 1.62 g (57%) yield; 31 P NMR (DMSO-rf,*) 8 147.81 and 

30 147.16. 
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Example 7 



This example illustrates the preparation of 3-[(5-{6-[(lE)-l-aza-2- 
(dimethylamino)vinyl]-4-[(lZ)-l-aza-2-(dimethylamino)vinyl]-3-(2-furyl)pyrazolo[3,4- 
5 d]pyrimidinyl}-2-{[bis(4-methoxyphenyl)phenylmethoxy]methyl}oxolan-3- 

yloxy)[bis(methylethyl)amino]phosphinooxy]propanenitrile (39, See Reaction Scheme 9) 

3-(2-furyl)pyrazolo[3,4-d]pyrimidine-4,6-diamine (36) 

A mixture of finely ground 31 (10.0 g, 57.47 mmol) and guanidine carbonate 

10 (16.6 g, 91 .95 mmol) was heated at 230 °C for 45 min. The mixture was cooled to room 

temperature and the solid was triturated in 100 mL of boiling water. The solid (pure 36) was 
filtered, rinsed with water and dried: 1 1.1 g (89%) yield; TLC (40% methanol in ethyl 
actate), R f =0.66; 'H NMR ((DMSO-^) 5 12.68 (1H, br s, N-H), 7.86 (1H, m, furanyl), 
6.93 (2H, br s, -NH 2 ), 6.85 (1H, d, J = 3.5 Hz, furanyl), 6.65 (1H, m, furanyl), 6.10 (2H, br s, 

15 -NH 2 ). 

5-(4,6-diamino-3-(2-furyl)pyrazolo[3,4-d]pyrimidinyl)-2-(hydroxymethyl)oxolan-3 (37) 

Compound 36 (10.5 g, 48.61 mmol) was stirred in 200 mL of 0.29 M 
methanolic KOH solution for 5 min. The mixture was evaporated to dryness and the residue 

20 was then dissolved in 105 mL of hot anhydrous DMF. The solution was cooled to room 
temperature and then diluted with 620 mL of anhydrous acetonitrile. The chlorosugar 
derivative (23 g, 59.14 mmol) was added immediately and the mixture was stirred for 40 min 
and then filtered. The filtrate was evaporated to dryness and the residue was 
chromatographed through a silica gel column eluting with ethyl acetate. The nucleoside 

25 fractions were evaporated affording 2.8 g (10 % yield) of the toluoyl protected nucleoside as 
a mixture of alpha- and beta-anomers. This mixture was refluxed in 40 mL of 0.19 M 
methanolic sodium methoxide for 45 min. The reaction solution was placed in a freezer 
overnight producing a crop of crystals corresponding to the pure beta-anomer 37: 690 mg 
yield; TLC (20% methanol in ethyl actate), R f = 0.32; 'H NMR (DMSO-4j) 8 7.90 (lH.m, 

30 furanyl), 6.99 (2H, br s, amino), 6.86 (1H, d, J = 4.1 Hz, furanyl), 6.68 (1H, m, furanyl), 6.41 
(IH, t, J = 6.6 Hz, 1 '-H), 6.26 (2H, br s, amino), 5.21 (1H, d, J = 4.4 Hz, 3'-OH), 4.82 (1H, t, 
J = 5.8 Hz, 5'-OH), 4.42 (1H, m, 3'-H), 3.79 (1H, m, 4'-H), 3.52 & 3.41 (2H, 2 x m, 5'-Hs), 
2.75 & 2.18 (2H, 2 x m, 2'-Hs). 
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5-{6-[(lE)-l-aza-2-(dimethylamino)vinyl]-4-[(lZ)-l-aza-2-(dimethylam^ 
furyl)pyrazolo[3, 4-d]pyrimidinyl}-2-(hydroxymethyl)oxolan-3-ol (38). 

A solution of 37 (0.68 g, 2.05 mmol) in 30 mL of anhydrous DMF and 15 mL 
of N,N-dimethylformamide dimethylacetal was stirred at room temperature for 24 h. The 
solution was evaporated to dryness and the residue was evaporated twice from xylenes 
affording a foam: 0.90 g (99%) yield; TLC (50 % methanol in ethyl actate), R f = 0.38; *H 
NMR (DMSO-tf 6 ) 5 8.93 & 8.77 (2H, 2 x s, formamidine C-Hs), 7.94 (1H, m, furanyl), 
7.76 (1H, m, furanyl), 6.62 (1H, m, furanyl), 6.60 (1H, t, J = 6.6 Hz, 1 '-H), 5.25 (1H, d, J = 
4.4 Hz, 3'-OH), 4.84 (1H, t, J = 5.9 Hz, 5'-OH), 4.47 (1H, m, 3'-H), 3.83 (1H, m, 4'-H), 3.56 
& 3.41 (2H, 2 x m, 5*-Hs), 3.25, 3.18, 3.16 & 3.03 (12H, 4 x s, N-methyls), 2.82 & 2.22 (2H, 
2 x m, 2'-Hs).] 

3-[(5-{6-[(lE)-l-aza-2-(dimethylamino)vinyl]-4-[(lZ)-l-aza-2^ 
furyl)pyrazolof3,4-d]pyrimidinyl}-2-{fbis(4-methoxyphenyl)phen 
yloxy)[bis(methylethyl) amino] phosphinooxyjpropanenitrile (39). 

Dimethoxytrityl chloride (0.85 g) was added to a solution of 38 (0.90 g, 2.04 
mmol) in 12 mL of dry pyridine. The reaction solution was stirred for 2 h at room 
temperature and then poured into 200 mL of 5% sodium bicarbonate solution. The product 
was extracted with ethyl acetate (3 x 300 mL) and the pooled organic extracts were dried over 
sodium sulfate and evaporated. The residue was purified using silica gel chromatography 
eluting with a gradient of 20-30% methanol in ethyl acetate. The pure product fractions were 
evaporated affording a foam: 286-mg yield of the 5'-0-DMT derivative. 

To a solution of the DMT derivative (286 mg, 0.384 mmol) dissolved in 5 mL 
of dry methylene chloride, containing 0.23 mL of N,N-diisopropylethylamine, was added 
0.17 mL of 2-cyanoethyl diisopropylchlorophosphoramidite. The solution was stirred under 
argon for 15 min at room temperature and then treated with 0.5 mL of methanol. The 
resulting solution was diluted with 100 mL of ethyl acetate and washed with 75 mL of 5% 
sodium bicarbonate solution. The organic solution was dried over sodium sulfate, filtered 
and evaporated. The residue was purified by silica gel chromatography eluting with a 
gradient of 0-30 % methanol in ethyl acetate (2% triethylamine). The pure product fractions 
were evaporated affording a foam: 230 mg (12 %) yield; 3, P NMR (DMSO-^) 6 147.77 
and 147.08. 
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Example 8 



This example illustrates the preparation of 4-[l-(5-{[bis(4- 
5 methoxyphenyl)phenylmethoxy]methyl} -4- {[bis(methylethyl)amino](2- 

cyanoethoxy)phosphinooxy}oxolan-2-yl)-2,4-dioxo-13-dihydropyrimidin-5-yl]but-3-ynyl 4- 

methylbenzoate (see, 42 Reaction Scheme 10) 

Synthesis of 3-Butyn-l-yl p-Toluate (40) 

10 In a 500 mL round bottom flask equipped with a magnetic stirring bar and 

under inert atmosphere, 3-butyn-l-ol (50.3 g, 0.718 mol) was dissolved with anhydrous 
pyridine (200 mL) and the solution was cooled in an ice water bath. To the cold solution, 
using an addition funnel, ^-toluyl chloride (136.6 g, 0.86 mol, 1.2 eq) was added dropwise 
and the reaction mixture was stirred at room temperature overnight. 

15 To the mixture were added diethyl ether (350 mL) and water (100 mL). The 

organic layer was separated, and the aqueous solution was washed with diethyl ether (150 
mL). The organic fractions were combined and washed with 10% HC1 (3 x 100 mL), 
saturated NaHC0 3 solution (2 x 100 mL) and water (1 x 50 mL). The resulting solution was 
dried over anhydrous Na 2 S0 4 , filtered and the solvent was removed to give 142 g 

20 (quantitative yield) of (40) as an off-white solid. The product can be recrystallized form 
hexane or methanol, but was sufficiently pure to proceed to the next step. 

'H NMR (CDC1 3 , 300 MHz) 5 (ppm) 7.95 (d, 2H, J = 8.1 Hz), 7.24 (d, 2H, / 
= 8.1 Hz), 4.41 (t, 2H, J = 6.8 Hz), 2.67 (dd, 2H, J, =2.6 J 2 = 6.8 Hz), 2.40 (s, 3H), 2.03 (t, 
lH,y=2.4 Hz). 

25 

5 '-DMT-5-[4-(p-toluyloxy)butynyl]-2 '-deoxyurdine (41) 

A mixture of 5-iodo-2'-deoxyuridine (4.0 g, 1 1.30 mmol), 4-(p- 

toluyloxy)butyne (40) (5.7 g, 30.3 mmol), Cul (222 mg, 1.16 mmol), 

tetrakis[triphenylphosphine] P alladium[0] (0.67 g, 0.583 mmol) and triethylamine (2.0 mL) 
30 was stirred in 30 mL of anhydrous DMF under argon for 1 6 h. The mixture was evaporated 

to dryness and the residual oil solidified after stirring in 100 mL of ether. This solid was 

filtered and dried under vacuum. 
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A portion of the crude product (1.5 g) was dissolved in 20 mL of anhyrdrous 
pyridine and treated with dimethoxytrityl chloride (1.3 g). The solution was stirred for 2.0 h. 
under argon and then poured into 150 mL of 5% sodium bicarbonate solution. The product 
was extracted with 200 mL of ethyl acetate. The extract was dried over sodium sulfate, 
filtered and the solvent evaporated. The residue was purified by silica gel chromatography 
eluting with a gradient of 30-0% hexane in ethyl acetate). The product fractions were 
evaporated, affording a foam: 957-mg yield. TLC (30% hexane in ethyl acetate), R/ = 0.37; 
! H NMR (DMSO-d 6 ) 5 1 1.66 (1H, s, N-H), 7.9 - 6.8 (18H, aromatic protons), 6.1 1 (1H, t, J 
= 6.6 Hz, T-H), 5.35 (1H, br s, 3'-OH), 4.29 (1H, m, 3'-H), 4.15 (2H, t, J= 6.8 Hz, butyne 
methylene), 3.92 (1H, m, 4'-H), 3.71 (6H, s, methoxy groups), 3.24 & 3.07 (2H, 2 x m, 5'- 
Hs), 2.64 (2H, t, J = 6.8 Hz, butyne methylene), 2.37 (3H, s, toluyl methyl), 2.23 (2H, m, 2'- 
Hs). 

4'[l'(5-{[bis(4-methoxyphenyl)phenylmethoxy]methyl}-4- 
{[bis(methylethyl)amino](2-cyanoethoxy)phosphinooxy}oxolan-2-yl)-2,4- 
dioxo-l t 3-dihydropyrimidin-5-yl]but-3-ynyl 4-methylbenzoate (42) 
To a solution of the DMT derivative from above (0.92 g, 1.28 mmol) in 15 mL 
of anhydrous methylene chloride, containing 0.75 mL of N t N-diisopropylethylamine, was 
added 2-cyanoethyl diisopropylchlorophosphoramidite (0.56 mL). The solution was stirred 
for 30 min at room temperature under argon and then treated with 1.0 mL of methanol. The 
solution was evaporated down to about a 5 mL volume and loaded directly on to a silica gel 
column and eluted with 40% hexane in ethyl acetate (2% triethylamine). The pure fractions 
were pooled and evaporated, affording a foam: 0.91 g of (42) (78%) yield; 31 P NMR 
(DMSO-d 6 ) 5 147.72 & 147.39. 

Example 9 

This example illustrates the enhanced ability of modified oligonucleotides to 
discriminate between related target sequences. 

In this example, modified oligonucleotides were prepared containing PPPA, 
PPG and a minor groove binder. As Figure 3 illustrates, the use of a MGB in combination 
with PPPA and PPG increases the T m and allows the design of shorter probes compatible 
with PCR extension temperatures. These modified oligonucleotides also yield increased 
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mismatch discrimination. In a first design of a 18-mer fluorescein-ODN-Red 13quencher- 
MGB real-time PCR probe, poor discrimination of an A/T mismatch was observed as shown 
in Figure 3 A. Re-design of the probe against the opposite strand, putting the mismatch under 

5 the MGB and substituting PPPA for A and PPG for G as indicated, allows shortening of the 
probe to a 15-mer. This probe now gives good mismatch discrimination as shown in Figure 
3B. 

A thermodynamic investigation of mismatch discrimination was performed on 
a set of oligonucleotides hybridized to a set of targets perfectly match or containing a single 
10 mismatch. The target sequences contain a) normal As, b) PPPA, c) normal As and a 3'-MGB 
and d) PPPA and a 3' -MGB, respectively. The sequences for the probes and targets are 
shown below in Table 3a and 3b. The determination of T m s and the calculation of 



Table 3a and 3b 

15 Probe and target sequences 



A. Probe Sequences - Mismatch Underlined 


Number 


Mismatch 


Probe Sequence 


1 


Complement 


AAAGTTATGTCTACTTACAGAAA 


2 


AJC 


AAAGCTATGTCTACTTACAGAAA 


3 


AJC 


AAAGTCATGTCTACTTACAGAAA 


4 


T/G 


AAAGTTGTGTCTACTTACAGAAA 


5 


AJC 


AAAGTTACGTCTACTTACAGAAA 


6 


C/A 


AAAGTTATATCTACTTACAGAAA 


7 


AJC 


AAAGTTATGCCTACTTACAGAAA 


8 


G/T 


AAAGTTATGTTTACTTACAGAAA 


9 


AJC 


AAAGTTATGTCCACTTACAGAAA 


10 


T/G 


AAAGTTATGTCTGCTTACAGAAA 


11 


G/T 


AAAGTTATGTCTATTTACAGAAA 


12 


AJC 


AAAGTTATGTCTACCTACAGAAA 


13 


AJC 


AAAGTTATGTCTACTCACAGAAA 


14 


T/G 


AAAGTTATGTCTACTTGCAGAAA 


B. Target Sequences - A'=PPPA 


1 


GTAAGTAGACATAAC 


2 


GTA'A'GTA'GA'CA'TA'A'C 


3 


GTAAGTAGACATAAC-MGB 


4 


GTA'A'GTA'GA'CA'TA'A'C-MGB 
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Table 3c 



Thermodynamic comparison of mismatch discrimination in terms of free energy 
5 increment at 50°C where AAG%=R 0 ln(K ma tch/K m i Sma tch) 



Number 


Mismatch 


A 


PPPA 


MGB 


MGB+PPPA 


2 


A/C 


2340 


2930 


OQ7H 




3 


A/C 


2560 


3280 


4100 


6320 


4 


T/G 


1950 


1810 


4200 


5900 


5 


A/C 


3520 


3760 


3830 


4980 


6 


C/A 


5030 


5340 


4190 


5970 


7 


A/C 


3000 


3370 


4310 


5260 


8 


G/T 


3040 


3260 


3070 


4820 


9 


A/C 


3290 


3440 


3810 


5630 


10 


T/G 


1800 


1950 


2090 


3350 


11 


G/T 


3340 


3120 


3630 


5070 


12 


A/C 


2940 


3620 


2550 


4490 


13 


A/C 


2360 


3210 


1820 


3980 


14 


T/G 


1600 


2010 


2000 


2480 



AAG°5o is described in Example 9. Table 3c clearly shows increased mismatch 
10 discrimination when PPPA is substituted for A and even larger discrimination when PPPA is 
combined with a MGB. 

Comparison of the thermodynamic discrimination of mismatched base pairs 
formed in MGB-ODNs containing HO-PPPA/HO-PU with PPPA/PU at 37°C is shown in 
Table 4. The ODNs containing the modified bases in combination with MGB were 
15 hybridized to their complements. Mismatches are underlined in the sequences shown in 

Table 4. As shown in this table, HO-PPPA and HO-PU substitution compared to PPPA and 
PU, shown for the most part increased mismatch discrimination. 
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Table 4 



Comparison of thermodynamic discrimination of mismatched base pairs formed by 
5 HOPPPA or HOPU vs PPPA and PU in the 8-mer duplexes (+MGB). 





Sequence of Duplex 


PPPA/ 

r U 


HOPPPA/ 

nUr U 




Sequence of 


PPPA/ 
PU 


HOPPPA/ 

HO PI I 






AAG 
cal/mol 


AAG 
cal/mol 






AAG 

m i/m a! 


AAG 

rn |/mnl 


Match 


. CGUCACUG-MGB 
AGCTGTGACT 






Match 


UAUUAUUG-MGB 
AATAATAACC 






I 


CGUCACUG-MGB 
AGCTGTGACT 


4250 


4350 


10 


UAUUAUUG-MGB 
AATTATAACC 


4400 


5000 


2 


CGUCACUG-MGB 
AGCGGTGACT 


3450 


3540 


11 


UAUUAUUG-MGB 
AATGATAACC 


3740 


3760 


3 


CGUCACUG-MGB 
AGCCGTGACT 


4860 


4530 


12 


UAUUAUUG-MGB 
AATCATAACC 


6630 


6840 


4 


CGUCACUG-MGB 
AGCAGAGACT 


4870 


4850 


13 


UAUUAUUG-MGB 
AATAAAAACC 


5090 


5730 


5 


CGUCACUG-MGB 
AGCAGGGACT 


4190 


4360 


14 


UAUUAUUG-MGB 
AATAAGAACC 


5920 


6520 


6 


CGUCACUG-MGB 
AGCAGCGACT 


3930 


3940 


15 


UAUUAUUG-MGB 
AATAACAACC 


4120 


4530 


7 


CGUCACUG-MGB 
AGCAATGACT 


2600 


2300 










8 


CGUCACUG-MGB 
AGCATTGACT 


4360 


4210 










9 


CGUCACUG-MGB 
AGCACTGACT 


4420 


4610 











AAG was calculated at 37°C. 



Example 10 

10 

This example illustrates the UV melting studies carried out using 
oligonucleotides of the present invention. 

Hybrids formed between unmodified ODNs or MGB-probes and their 
complements were melted at a rate of 0.5°C/min, on a Lambda 2S (Perkin-Elmer) 
1 5 spectrophotometer with a PTP-6 automatic multicell temperature programmer. T m data was 
prepared using 0.5x SSPE buffer (Sigma, pH 7.4). Each ODN (luM of each strand) was 
mixed with its complement to give a 1 : 1 ratio. Prior to melting, samples were denatured at 
100°C and then cooled to 10°C over a 10 min period. Mismatch discrimination for each type 
of duplex was calculated in terms of AAG at 50°C using the equation: 
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AAG°50 — R°ln(Kmatch/K-mismatch)- 



The term K match /K mismaI ch can be determined using the relative fractions of 
5 duplex and single strands calculated from the melting curves at 50°C (see Lohkov, S.G. & 
Pyshnyi, FEBS Letters 420:134-138 (1997)). 



Example 11 



10 This example illustrates the hybridization of DNA to oligonucleotide glass 

microarrays prepared as described in co-pending application Ser. No. 09/364,320 and further 
optimized for 8-10 mer MGB probes. 

DNA (oligonucleotide or PCR amplicon) at 1-5 x 10" 7 M in 5XSSPE, 0.1% 
Trition X-100, 10% formamide was hybridized to the microarray in Frame-Seal Chambers 

15 (MJ Research) under the following conditions: 5 minutes at 55°C, slow cooling at 0.1 °C/sec 
to 35 °C, 60 minutes at 55°C. The slides were then washed in 0.5XSSPE for 5 minutes at 
45°C. Slides were dried under a stream of air and scanned using an Array Works Fluorescent 
scanner (Applied Precision). The washing procedure can be repeated under more stringent 
conditions if required. 

20 

Example 12 



This example illustrates a single base primer extension assay. 
Single stranded template DNA and 6-mer primer (20p.M each), IX Thermopol 
25 buffer (New England Biolabs), lOmM manganese chloride (USB), were combined and 

incubated 5 minutes at 50°C. Then 5 p.Ci cc 32 P ddATP and 8 units Bst polymerase (NEB) 
were added, bringing total volume to lOul, and incubated 15 minutes at 50°C. After 
incubation, 6ul stop solution (95% formamide, 20mM EDTA) was added and reactions were 
cooled to room temperature. Samples were diluted 1:10 in denaturing dye (35% formamide, 
30 0.05% xylene cyanol, 0.05% bromophenol blue, 1 mM EDTA), and aliquots were analyzed by 
electrophoresis on a denaturing 10% polyacrylamide gel. 
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The use of modified bases in primer extension assays for SNP typing is 
illustrated in Table 5. Primer extension with Bst polymerase (NEB) was evaluated with a 
primer substituted with different modified bases and the amount of product was measured 
based on the incorporated radioactivity using gel electrophoresis. 

5 

Table 5 

Comparison of 32 P-incorporation in primer extension product by polyacrylamide 



gel electrophoresis using AAC CAC TCT GTC CTA template 



Primer 


Relative Signal 


TGAGAC 


-H- 


U^GAGAC 


+ 


LT^G^AGAC 


-w-f 


jjpropQprop^ropQ^ 


4H-h+-h 


jjpropQprop^propQpropj^Q 


11111 


jjpropQpropj^propQprop^prop q 


-h 


jjpropQprop ppropQprop ppropQprop 


0.5+ 



Example 13 



This example illustrates the use of the algorithm described in the specification 
to predict the T m of modified oligonucleotides containing PPG both with and without a minor 

1 5 groove binder (CDPI 3 ) 

Using thermodynamic parameters in Tables 2a/2b and the correction formula 
for the MGBs contribution, T m of PPG oligonucleotides with and without a MGB can be 
calculated with algorithm procedures as described above. The thermodynamic parameters of 
the nearest neighbor pairs that are not G-containing do not change when the oligonucleotides 

20 are substituted with PPG. Similarly the thermodynamic parameters of the nearest-neighbor 
pairs do not change when MGB containing oligonucleotides are substituted with PPG. 
Table 6 illustrates the ability of the algorithm to predict the T m of oligonucleotides containing 
PPG alone or in combination with a minor groove binder (CDPI3). In Table 6, all Gs are 
substituted with PPGs. 

25 
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Table 6 

Comparison of experimental T m s with that of predicted T m s using the nearest-neighbor 
thermodynamic parameters for PPG containing oligonucleotides and PPG containing 

oligonucleotides attached to a MGB 



S'-Probe Sequence -3' 


ODN Duplex Stability °C 


MGB-ODN Duplex Stability 
°C 


Tm 
1 1 1 'exp 


Tm . 
i • • 'calc 


Err 


Tin _ v ._ 

1 * ' * exp 


Tm r ,i, 

• ' ' 'calc 


Err 


CTGTAAGTAGATATAAC 


51 .84 


53 23 


1 .39 


65.88 


66.69 


0.81 


GGCAAGATATATAG 


so. 91 


4Q ft1 


-0.40 


66.37 


65.56 


-0.81 


GTGACGCAGATTCC 


fi1 97 




-0 21 


76.97 


75.19 


-1.78 


GTAAGTAGACATAAC 


S9 1 9 


fi1 7ft 
J l.i o 


-n ^4 


64.64 


63.31 


-1 .33 


CAGGGAGCTTTGGA 


SQ Q 


fiO 99 

\J\J .C-Z- 


0.32 


74.39 


71 .47 


-2.92 


CACTCGTGAAGCTG 


fin as 


fiQ AO 


-1 


74.04 


72.26 


-1 .78 


GTAAGTAGGCATAAC 


fiS 74 
DO. I f 


fifi A7 


-0 97 


66.91 


66.00 


-0.91 


CCGGATGTAGGATC 


S7 S9 




1 .53 


69.3 


70.03 


0.73 


GATTACCTGGATTT 


fi4 


fin 19 


-0.32 


62.29 


62.33 


0.04 


CCGTCAATGGTCAC 


Sfl fifi 
OO.DD 


fin m 


1 .35 


70.13 


69.91 


-0.22 


CAGCACGTAGCC 


S7 ^1 


fift 07 


0.76 


69.29 


67.60 


-1.69 


CGGCTACGTGCTGG 


fiS 1 Q 


fifi 01 


0.82 


76.12 


74.79 


-1.33 


CGGCTACATGCTGG 


fi*1 i 4 


fi1 Qfi 


n fii 


71 56 


72.99 


1 .43 


CTAAATCTGCCG 


fin A 


4R HQ 


-9 11 


62.08 


60.19 


-1.89 


TCTGGATGATGGGCA 


fi*1 74 


fi1 Qfi 


n 91 


71 65 


72 13 


0.48 


GTTCATGGGTGTAAT 


fi7 fii 
D/.O 1 


fi7 77 


0 9fi 


fifi 94. 


68.79 


1 .85 


CGGAGGTAGGATCA 


fiQ 04 


CQ /1C 

oy .*f o 


n 99 


fiQ 46 


70.93 


1.47 


CCACCCGCCTCAG 


fin 71 
ou. / o 


fi1 14 

D I. IH 


n 41 


71 41 


70.74 


-0.69 


CACAGGAGTGGTTGG 


fil H7 


fizi 4n 


1 11 


72.28 


72.92 


0.64 


CGGACCAGTGCGTG 


fift 1 
DO. I 


fi7 fift 




77.92 


76.80 


-1.12 


TCGGAC C AGTGCGT 


fifi n4 

UJ.U4 


fifi nn 


n Qfi 


74.94 


75.62 


0.68 


AACGGGGTACGATA 


fi7 Q1 


fi7 1 1 


-0 ft9 


67.79 


67.08 


-0.71 


CAGTTGAGATTCTAAGAC 


fin nfi 


fin 1 fi 


n hp 


67.1 5 


67.43 


0.28 


AGGGGCGTCTTG 


fin 7p 


fift fi7 


-9 91 


71 62 


72.76 


1.14 


GTAAGTAGGCATAGC 


58.34 


58.95 


0.61 


65.95 


66.99 


1.04 


TGCCCAGCCCCAG 


63.13 


63.40 


0.27 


71.28 


71.32 


0.04 


LLAAlAL 1 CCj 1 bAA 


54.87 


56.14 


1.27 


62.07 


63.54 


1.47 


GTAAGTAGACACAGC 


59.48 


58.41 


-1.07 


65.79 


66.27 


0.48 


TCGGACCAGTGC 


I 58.02 


58.55 


0.53 


65.99 


66.35 


0.36 


CGATCACGCTGGC 


62.12 


62.75 


0.63 


69.18 


71.81 


2.63 


GTCCTGGGGGTGG 


65.19 


64.54 


-0.65 


72.78 


72.53 


-0.25 


GTAAGTAGGTGTGAC 


60.7 


59.70 


-1.00 


66.92 


67.00 


0.08 


GGTTGTACGGGTTCACG 


68.38 


68.81 


0.43 


74.16 


75.38 


1.22 


GGACCAGTGCGTGA 


66.84 


65.46 


-1.38 


73.38 


71.53 


-1.85 


GTAAGTAGACGCAGC 


62.91 


62.44 


-0.47 


68 


67.82 


-0.18 


GTAAGTAGGCGCAGC 


65.52 


65.91 


0.39 


69.8 


70.34 


0.54 


GTAAGTAGGCGCGGC 


68.71 


68.96 


0.25 


72.26 


72.76 


0.50 


GGTTCCCGAGCG 


| 62.15 


61.14 


-1.01 


65.75 


64.22 


-1.53 
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The accuracy of the prediction algorithm is about +/- 1 and +/- 2°C for PPG 
containing oligonucleotides and PPG containing MGB-oligonucleotides, respectively. 
Accordingly, a sequence of interest can be obtained from a source such as Genbank, and a T m 
window can then be set as a requirement for a set of probes or primers. Using the algorithm 
5 above, and information from the nearest-neighbor parameters, a collection of probe or primer 
sequences with the desired T m s can be calculated. 

Alternatively, the algorithm can be used to select nearest-neighbor parameters 
from a selection of modified bases and to calculate the stability of more than one sequence of 
the same length to give the same predetermined thermodynamic stability, with the selected 
10 modified bases. 

Still further, the algorithm can be used to select one or more modified bases 
that allow the design of oligonucleotides of the same base pair length and substantially the 
same stability (T m ). These modified bases can be selected from a database containing pre- 
determined thermodynamic nearest neighbor parameters from an collection of modified 
15 purines and/or modified pyrimidines. Preferably, the database contains nearest-neighbor 
parameters of 3-substituted pyrazolo[3,4-d]pyrimidines and/or 5-substituted pyrimidines. 

Example 14 

20 This example illustrates the use of modified oligonucleotides in an Invader™ 

assay. 

The cleavase-based assay is shown in Figure 4 detects specific DNA and RNA 
sequences to cleave a complex formed by the hybridization of two overlapping 
oligonucleotides to a target. The enzyme cleaves the overhanging "flap", which serves as an 
25 invader probe in the detection cassette probe where cleavage releases a fluorescence signal. 
The first cleavage takes places only when the single base mismatch in the invader is a perfect 
match. No reaction takes place with a mismatch target B. The cleaved "flap" serves as an 
invader in the detection cassette, leading to the release of fluorescence in the second cleavage 
step. 
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Table 7a and 7b 

Comparison of Cleavase-bases amplification system with different modified bases substituted 

in the genomic and invader probes. 

5 

Table 7a 



# 


Probe 1 


Modified Bases Substituted in Probe Sequence 2 


Length 


1 


Cassette 


None 


41 


2 


Invader T 


None 


59 


3 


Invader Ml 


Five A24 bases 


32 


4 


Genomic T 


None 


43 ! 


5 


Genomic Ml 


Six A14 bases 


35 


6 


Genomic M2 


Five A24 bases 


31 



1 Unmodified Probes and assay conditions are similar to those described by Hall et al, PNAS 
10 97:8272-77 (2000) 

2 A14 is hydroxypropynylPPA and A24 is 3-iododiaminoPPA 



Table 7b 



Genomic probe 


Endpoint Match/Mismatch 
Signal Ratio 


AF RFU J 


4 


5 


4100 


5 


10 


8000 


6 


7 


11900 


i AF is end point fluorescence difference between match and 



15 mismatch in relative fluorescence units 

Table 7a shows a comparison of the effect of different modified bases when 
substituted in the Invader and genomic probes. The traditional invaderT (2) was substituted 
with 3-iododiaminoPPPA to give invader Ml which now has a length of 32 with a similar T m 

20 as the 59-mer (2). Similarly the traditional genomic probe (4) was substituted 3- 

hydroxypropynylPPPA and 3-iododiaminoPPPA to give (5) and (6) respectively. Both of 
these probes are substantially shorter than the traditional (4). Using invaderMl (3) in 
combination with the genomic probes 4-6 individually results in (Table 7b) improved 
performance of both genomic probes containing the modified bases (5 and 6) compared to the 
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traditional genomic probe (4). As shown both the endpoint match/mismatch signal ratios 
and the end point fluorescence show increases compared to the unmodified genomic probe. 

All publications and patent applications cited in this specification are herein 
5 incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
foregoing invention has been described in some detail by way of illustration and example for 
purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in 
the art in light of the teachings of this invention that certain changes and modifications may 
10 be made thereto without departing from the spirit or scope of the appended claims. 
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